[LUCENE-5354] Blended score in AnalyzingInfixSuggester - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 4.4
Fix Version/s: 4.7, 6.0
Component/s: modules/spellchecker
Labels:
- suggester

Lucene Fields:

New

Description

I'm working on a custom suggester derived from the AnalyzingInfix. I require what is called a "blended score" (//TODO ln.399 in AnalyzingInfixSuggester) to transform the suggestion weights depending on the position of the searched term(s) in the text.

Right now, I'm using an easy solution :
If I want 10 suggestions, then I search against the current ordered index for the 100 first results and transform the weight :

a) by using the term position in the text (found with TermVector and DocsAndPositionsEnum)

b) by multiplying the weight by the score of a SpanQuery that I add when searching

and return the updated 10 most weighted suggestions.

Since we usually don't need to suggest so many things, the bigger search + rescoring overhead is not so significant but I agree that this is not the most elegant solution.
We could include this factor (here the position of the term) directly into the index.

So, I can contribute to this if you think it's worth adding it.

Do you think I should tweak AnalyzingInfixSuggester, subclass it or create a dedicated class ?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5354_2.patch
18/Dec/13 18:12
22 kB
Remi Melisson
LUCENE-5354_3.patch
09/Jan/14 16:32
23 kB
Remi Melisson
LUCENE-5354_4.patch
14/Jan/14 11:56
2 kB
Remi Melisson
LUCENE-5354.patch
16/Dec/13 13:39
23 kB
Remi Melisson

Activity

People

Assignee:: Unassigned

Reporter:: Remi Melisson

Votes:: 1 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 02/Dec/13 12:19

Updated:: 28/Aug/22 13:57

Resolved:: 14/Jan/14 16:33