Description
uax29 word break rules already know how to handle these correctly, we just need to assign them a token type.
This is better than users trying to do this with custom rules (e.g. LUCENE-7916) because they are script-independent (common/inherited).
Attachments
Attachments
Issue Links
- is blocked by
-
LUCENE-8122 upgrade to icu > 60.2
- Reopened
- is related to
-
LUCENE-8527 Upgrade JFlex to 1.7.0
- Resolved