[LUCENE-1536] if a filter can support random access API, we should use it - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 2.4
Fix Version/s: 4.0-ALPHA
Component/s: core/search
Labels:

Lucene Fields:

New

Description

I ran some performance tests, comparing applying a filter via
random-access API instead of current trunk's iterator API.

This was inspired by ~~LUCENE-1476~~, where we realized deletions should
really be implemented just like a filter, but then in testing found
that switching deletions to iterator was a very sizable performance
hit.

Some notes on the test:

Index is first 2M docs of Wikipedia. Test machine is Mac OS X
10.5.6, quad core Intel CPU, 6 GB RAM, java 1.6.0_07-b06-153.

I test across multiple queries. 1-X means an OR query, eg 1-4
means 1 OR 2 OR 3 OR 4, whereas +1-4 is an AND query, ie 1 AND 2
AND 3 AND 4. "u s" means "united states" (phrase search).

I test with multiple filter densities (0, 1, 2, 5, 10, 25, 75, 90,
95, 98, 99, 99.99999 (filter is non-null but all bits are set),
100 (filter=null, control)).

Method high means I use random-access filter API in
IndexSearcher's main loop. Method low means I use random-access
filter API down in SegmentTermDocs (just like deleted docs
today).

Baseline (QPS) is current trunk, where filter is applied as iterator up
"high" (ie in IndexSearcher's search loop).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

CachedFilterIndexReader.java
13/Sep/10 18:44
4 kB
Michael McCandless
changes-yonik-uwe.patch
08/Oct/11 09:31
3 kB
Uwe Schindler
LUCENE-1536_hack.patch
10/Oct/11 17:02
150 kB
Robert Muir
LUCENE-1536.patch
24/Oct/11 12:59
52 kB
Uwe Schindler
LUCENE-1536.patch
13/Oct/11 10:15
148 kB
Uwe Schindler
LUCENE-1536.patch
11/Oct/11 11:43
148 kB
Uwe Schindler
LUCENE-1536.patch
08/Oct/11 19:04
148 kB
Uwe Schindler
LUCENE-1536.patch
08/Oct/11 15:51
147 kB
Uwe Schindler
LUCENE-1536.patch
08/Oct/11 14:25
147 kB
Robert Muir
LUCENE-1536.patch
08/Oct/11 09:31
141 kB
Uwe Schindler
LUCENE-1536.patch
07/Oct/11 22:54
141 kB
Yonik Seeley
LUCENE-1536.patch
07/Oct/11 13:04
123 kB
Robert Muir
LUCENE-1536.patch
06/Oct/11 12:33
52 kB
Robert Muir
LUCENE-1536.patch
05/Oct/11 20:37
29 kB
Uwe Schindler
LUCENE-1536.patch
05/Oct/11 18:34
28 kB
Uwe Schindler
LUCENE-1536.patch
05/Oct/11 18:17
28 kB
Uwe Schindler
LUCENE-1536.patch
05/Oct/11 02:17
24 kB
Chris Male
LUCENE-1536.patch
04/Oct/11 19:55
24 kB
Robert Muir
LUCENE-1536.patch
04/Oct/11 19:03
20 kB
Robert Muir
LUCENE-1536.patch
02/Oct/11 12:25
24 kB
Chris Male
LUCENE-1536.patch
28/Sep/11 12:41
75 kB
Michael McCandless
LUCENE-1536.patch
27/Sep/11 18:30
71 kB
Michael McCandless
LUCENE-1536.patch
27/Sep/11 10:51
68 kB
Chris Male
LUCENE-1536.patch
23/Sep/11 05:33
67 kB
Chris Male
LUCENE-1536.patch
22/Sep/11 14:21
69 kB
Chris Male
LUCENE-1536.patch
09/Jul/11 14:53
63 kB
Michael McCandless
LUCENE-1536.patch
26/Jun/11 18:27
199 kB
Michael McCandless
LUCENE-1536.patch
16/Sep/09 06:25
55 kB
Jason Rutherglen
LUCENE-1536.patch
15/Sep/09 22:14
51 kB
Jason Rutherglen
LUCENE-1536.patch
21/Apr/09 17:45
26 kB
Jason Rutherglen
LUCENE-1536.patch
21/Apr/09 02:48
19 kB
Jason Rutherglen
LUCENE-1536.patch
04/Feb/09 20:34
11 kB
Michael McCandless
LUCENE-1536-rewrite.patch
08/Oct/11 09:31
0.1 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
07/Oct/11 15:25
137 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
07/Oct/11 14:07
128 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
07/Oct/11 11:05
115 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
06/Oct/11 23:12
88 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
06/Oct/11 22:04
84 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
06/Oct/11 14:26
71 kB
Uwe Schindler
LUCENE-1536-rewrite.patch
06/Oct/11 12:13
52 kB
Uwe Schindler
luceneutil.patch
07/Oct/11 19:24
3 kB
Robert Muir

Issue Links

breaks

SOLR-3062 Solr4 Join query with fq not correctly filtering results

Closed

is blocked by

LUCENE-3503 DisjunctionSumScorer gives slightly (float iotas) different scores when you .nextDoc vs .advance

Closed

is related to

LUCENE-4548 BooleanFilter should optionally pass down further restricted acceptDocs in the MUST case (and acceptDocs in general)

Resolved

relates to

LUCENE-3212 Supply FilterIndexReader based on any o.a.l.search.Filter

Closed

Activity

People

Assignee:: Uwe Schindler

Reporter:: Michael McCandless

Votes:: 2 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 04/Feb/09 20:29

Updated:: 28/Aug/22 11:57

Resolved:: 25/Oct/11 12:09