Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-6760

TestUtil#randomWhitespace() broken for Java 9 (Unicode 7)

    Details

    • Type: Test
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 5.3.1, 5.4, 6.0
    • Component/s: None
    • Labels:
    • Environment:

      Java 9 build 76+

    • Lucene Fields:
      New

      Description

      Java 9 changed its character tables to unicode version 7.0. The table updates are listed here: http://hg.openjdk.java.net/jdk9/jdk9/jdk/rev/86206517258d

      Because of this, character \u180E is no longer treated as whitespace, so TestUtil#randomWhitespace() fails.

      I will remove this character from the list and update the documentation JRE_VERSION_MIGRATION.txt.

      We should maybe make this character list dynamic (e.g. TestUtil initializes it on static class init one time by iterating over all 16 bit characters). Maybe somebody else has an idea (there is already a TODO in the code about that).

        Attachments

        1. LUCENE-6760.patch
          2 kB
          Uwe Schindler
        2. LUCENE-6760.patch
          2 kB
          Uwe Schindler
        3. LUCENE-6760.patch
          2 kB
          Uwe Schindler
        4. LUCENE-6760.patch
          1 kB
          Uwe Schindler

          Issue Links

            Activity

              People

              • Assignee:
                thetaphi Uwe Schindler
                Reporter:
                thetaphi Uwe Schindler
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: