[LUCENE-2690] Do MultiTermQuery boolean rewrites per segment - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 4.0-ALPHA
Fix Version/s: 4.0-ALPHA
Component/s: None
Labels:
None

Lucene Fields:

New, Patch Available

Description

MultiTermQuery currently rewrites FuzzyQuery (using TopTermsBooleanQueryRewrite), the auto constant rewrite method and the ScoringBQ rewrite methods using a MultiFields wrapper on the top-level reader. This is inefficient.

This patch changes the rewrite modes to do the rewrites per segment and uses some additional datastructures (hashed sets/maps) to exclude duplicate terms. All tests currently pass, but FuzzyQuery's tests should not, because it depends for the minimum score handling, that the terms are collected in order..

Robert will fix FuzzyQuery in this issue, too. This patch is just a start.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-2690.patch
15/Oct/10 14:24
62 kB
Uwe Schindler
LUCENE-2690.patch
15/Oct/10 10:16
56 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 23:58
49 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 20:00
46 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 18:11
46 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 16:28
46 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 15:50
45 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 14:13
39 kB
Robert Muir
LUCENE-2690.patch
14/Oct/10 10:55
27 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 10:26
26 kB
Simon Willnauer
LUCENE-2690.patch
14/Oct/10 09:57
26 kB
Uwe Schindler
LUCENE-2690.patch
14/Oct/10 09:23
26 kB
Uwe Schindler
LUCENE-2690.patch
13/Oct/10 20:52
26 kB
Simon Willnauer
LUCENE-2690.patch
10/Oct/10 03:28
19 kB
Uwe Schindler
LUCENE-2690.patch
10/Oct/10 03:01
16 kB
Michael McCandless
LUCENE-2690.patch
10/Oct/10 01:36
17 kB
Robert Muir
LUCENE-2690.patch
08/Oct/10 20:52
13 kB
Uwe Schindler
LUCENE-2690.patch
08/Oct/10 04:41
12 kB
Uwe Schindler
LUCENE-2690-attributes.patch
14/Oct/10 13:35
38 kB
Uwe Schindler
LUCENE-2690-attributes.patch
14/Oct/10 12:20
50 kB
Uwe Schindler
LUCENE-2690-attributes.patch
14/Oct/10 11:51
50 kB
Uwe Schindler
LUCENE-2690-hack.patch
10/Oct/10 11:21
23 kB
Michael McCandless

Issue Links

is related to

LUCENE-2130 Investigate Rewriting Constant Scoring MultiTermQueries per segment

Closed

LUCENE-2694 MTQ rewrite + weight/scorer init should be single pass

Closed

Activity

People

Assignee:: Uwe Schindler

Reporter:: Uwe Schindler

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 08/Oct/10 04:39

Updated:: 28/Aug/22 12:33

Resolved:: 15/Oct/10 14:26