Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
4.1
-
None
Description
Using the HTMLStripCharFilter may yield to an invalid HTML highlight.
The HTMLStripCharFilter has a special treatment of inline-elements (eg. "a", "b", ...). For theese elements the CharFilter ignores the tag and does not insert any split-character.
If you index
"""
<a>xxx</a>
"""
you get the word "xxx" starting at position 3 ending on position 10
If you highlight a search on "xxx", you will get
"""
<a><em>xxx</a></em>
"""
which is invalid HTML.