[LUCENE-5415] Support wildcard & co in PostingsHighlighter - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.7, 6.0
Component/s: modules/highlighter
Labels:
None

Lucene Fields:

New

Description

PostingsHighlighter uses the offsets encoded in the postings lists for the terms to find query matches.

As such, it isn't really suitable for stuff like wildcards for two reasons:
1. an expensive rewrite against the term dictionary (i think other highlighters share this problem)
2. accumulating data from potentially many terms (e.g. reading many postings)

However, we could provide an option for some of these queries to work, but in a different way, that avoids these downsides.

Instead we can just grab the Automaton representation of the queries, and match it against the content directly (which won't blow up).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5415.patch
25/Jan/14 20:59
59 kB
Robert Muir
LUCENE-5415.patch
25/Jan/14 20:08
42 kB
Michael McCandless
LUCENE-5415.patch
25/Jan/14 19:52
38 kB
Robert Muir
LUCENE-5415.patch
25/Jan/14 18:28
23 kB
Robert Muir
LUCENE-5415.patch
24/Jan/14 16:22
13 kB
Robert Muir

Activity

People

Assignee:: Unassigned

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 24/Jan/14 16:21

Updated:: 15/Sep/24 22:23

Resolved:: 26/Jan/14 05:07