Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
New
Description
SynonymFilter has two limitations today:
- It cannot create positions, so eg dns -> domain name service
creates blatantly wrong highlights (SOLR-3390,LUCENE-4499and
others).
- It cannot consume a graph, so e.g. if you try to apply synonyms
after Kuromoji tokenizer I'm not sure what will happen.
I've thought about how to fix these issues but it's really quite
difficult with the current PosInc/PosLen graph representation, so I'd
like to explore an alternative approach.