Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
New
Description
The PhraseHelper of the UnifiedHighlighter currently collects position-spans per SpanQuery (and it knows which terms are in which SpanQuery), and then it filters PostingsEnum based on that. It's similar to how the original Highlighter WSTE works. The main problem with this approach is that it can be inaccurate for some nested span queries – LUCENE-2287, LUCENE-5455 (has the clearest example), LUCENE-6796. Non-nested SpanQueries (e.g. that which is converted from a PhraseQuery or MultiPhraseQuery) are not a problem.