[LUCENE-6645] BKD tree queries should use BitDocIdSet.Builder - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.3, 6.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

When I was iterating on BKD tree originally I remember trying to use this builder (which makes a sparse bit set at first and then upgrades to dense if enough bits get set) and being disappointed with its performance.

I wound up just making a FixedBitSet every time, but this is obviously wasteful for small queries.

It could be the perf was poor because I was always .or'ing in DISIs that had 512 - 1024 hits each time (the size of each leaf cell in the BKD tree)? I also had to make my own DISI wrapper around each leaf cell... maybe that was the source of the slowness, not sure.

I also sort of wondered whether the SmallDocSet in spatial module (backed by a SentinelIntSet) might be faster ... though it'd need to be sorted in the and after building before returning to Lucene.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-6645.patch
30/Jun/15 19:26
13 kB
Michael McCandless
LUCENE-6645.patch
03/Jul/15 12:13
25 kB
Adrien Grand
LUCENE-6645.patch
07/Jul/15 16:56
51 kB
Adrien Grand
LUCENE-6645.patch
08/Jul/15 09:55
59 kB
Adrien Grand
LUCENE-6645.patch
09/Jul/15 10:03
3 kB
Michael McCandless
LUCENE-6645.patch
09/Jul/15 18:29
4 kB
Michael McCandless
LUCENE-6645-spatial.patch
15/Jul/15 12:55
10 kB
Adrien Grand

Activity

People

Assignee:: Unassigned

Reporter:: Michael McCandless

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 30/Jun/15 17:32

Updated:: 28/Aug/22 14:38

Resolved:: 09/Jul/15 21:23