The Greek and Russian analyzers support custom encodings such as KOI-8, they define things like Lowercase and tokenization for these.
I think that analyzers should support unicode and that conversion/handling of other charsets belongs somewhere else.
I would like to deprecate/remove the support for these other encodings.
|Component/s||modules/analysis [ 12310230 ]|
|Component/s||contrib/analyzers [ 12312333 ]|
|Workflow||Default workflow, editable Closed status [ 12562919 ]||jira [ 12583800 ]|
|Workflow||jira [ 12472761 ]||Default workflow, editable Closed status [ 12562919 ]|
|Status||Resolved [ 5 ]||Closed [ 6 ]|
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Resolution||Fixed [ 1 ]|
|Assignee||Robert Muir [ rcmuir ]|
|Fix Version/s||2.9 [ 12312682 ]|
|Lucene Fields||[New]||[New, Patch Available]|