Details
-
Improvement
-
Status: Reopened
-
Minor
-
Resolution: Fixed
-
9.0
-
None
-
New
Description
Follow-up issue of LUCENE-9413.
We now have CJKWidthCharFilter in analyzers-common. I believe in many situations it is recommended applying half-width/full-width character normalization before tokenization for consistency in analysis.
The change slightly affects on the analyzer's outputs. We can provide a parameter to switch back to CJKWidthFilter for backward compatibility.
Attachments
Issue Links
- is related to
-
LUCENE-9413 Add a char filter corresponding to CJKWidthFilter
- Resolved
- links to