[LUCENE-4198] Allow codecs to index term impacts - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 8.0
Component/s: core/index
Labels:
None

Lucene Fields:

New

Description

Subtask of ~~LUCENE-4100~~.

Thats an example of something similar to impact indexing (though, his implementation currently stores a max for the entire term, the problem is the same).

We can imagine other similar algorithms too: I think the codec API should be able to support these.

Currently it really doesnt: Stefan worked around the problem by providing a tool to 'rewrite' your index, he passes the IndexReader and Similarity to it. But it would be better if we fixed the codec API.

One problem is that the Postings writer needs to have access to the Similarity. Another problem is that it needs access to the term and collection statistics up front, rather than after the fact.

This might have some cost (hopefully minimal), so I'm thinking to experiment in a branch with these changes and see if we can make it work well.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-4198_flush.patch
05/Jul/12 20:36
11 kB
Robert Muir
LUCENE-4198.patch
03/Jan/18 15:50
252 kB
Adrien Grand
LUCENE-4198.patch
04/Jan/18 19:14
101 kB
Adrien Grand
LUCENE-4198.patch
05/Jan/18 14:23
143 kB
Adrien Grand
LUCENE-4198.patch
12/Jan/18 12:11
180 kB
Adrien Grand
LUCENE-4198-BMW.patch
12/Jan/18 18:52
65 kB
Adrien Grand
LUCENE-4198.patch
19/Jan/18 14:42
181 kB
Adrien Grand
TestSimpleTextPostingsFormat.sarowe.jenkins.nightly.master.681.consoleText.excerpt.txt
02/Feb/18 01:58
108 kB
Steven Rowe
TestSimpleTextPostingsFormat.asf.nightly.master.1466.consoleText.excerpt.txt
02/Feb/18 01:58
108 kB
Steven Rowe

Issue Links

supercedes

LUCENE-8087 Record per-term max term frequencies

Resolved

LUCENE-8083 Give similarities better values for maxScore

Resolved

links to

GitHub Pull Request #115

Activity

People

Assignee:: Unassigned

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 05/Jul/12 20:34

Updated:: 28/Aug/22 13:21

Resolved:: 31/Jan/18 13:48

Time Tracking

Estimated:

Not Specified

Remaining:

Logged: