Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-10239

upgrade jflex (1.7.0 -> 1.8.2)

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 9.1
    • None
    • None
    • New

    Description

      When reviewing LUCENE-10238, I noticed we still had unicode 9.0 data specified for our jflex tokenizers.

      According to the changelog I see some key benefits from upgrading to jflex 1.8.2:

      • unicode 9 -> unicode 12.1
      • remove our custom emoji regeneration via ICU, as jflex supports emoji properties directly now.
      • Less RAM at runtime to users (two stage tables): https://github.com/jflex-de/jflex/pull/697

      https://www.jflex.de/changelog.html

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rcmuir Robert Muir
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 10m
                  2h 10m