Details
-
Bug
-
Status: Open
-
Trivial
-
Resolution: Unresolved
-
2.9.4
-
None
-
None
-
New, Patch Available
Description
When you use the Highlighter combined with N-Gram tokenizers such as CJKTokenizer and try to highlight the phrase that appears around 50th term in the field, the highlighted phrase is shorter than expected.
e.g. Highlighting "fooo" in the following text with bigram tokenizer: "0---------1---------2---------3---------4---------fooo---" Expected: "0---------1---------2---------3---------4---------<B>fooo</B>---" Actual: "0---------1---------2---------3---------4---------f<B>ooo</B>---"