Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
New
Description
When reviewing LUCENE-10238, I noticed we still had unicode 9.0 data specified for our jflex tokenizers.
According to the changelog I see some key benefits from upgrading to jflex 1.8.2:
- unicode 9 -> unicode 12.1
- remove our custom emoji regeneration via ICU, as jflex supports emoji properties directly now.
- Less RAM at runtime to users (two stage tables): https://github.com/jflex-de/jflex/pull/697
Attachments
Issue Links
- links to