[LUCENE-1470] Add TrieRangeFilter to contrib - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.4
Fix Version/s: 2.9
Component/s: modules/other
Labels:
None

Lucene Fields:

New, Patch Available

Description

According to the thread in java-dev (http://www.gossamer-threads.com/lists/lucene/java-dev/67807 and http://www.gossamer-threads.com/lists/lucene/java-dev/67839), I want to include my fast numerical range query implementation into lucene contrib-queries.

I implemented (based on RangeFilter) another approach for faster
RangeQueries, based on longs stored in index in a special format.

The idea behind this is to store the longs in different precision in index
and partition the query range in such a way, that the outer boundaries are
search using terms from the highest precision, but the center of the search
Range with lower precision. The implementation stores the longs in 8
different precisions (using a class called TrieUtils). It also has support
for Doubles, using the IEEE 754 floating-point "double format" bit layout
with some bit mappings to make them binary sortable. The approach is used in
rather big indexes, query times are even on low performance desktop
computers <<100 ms for very big ranges on indexes with 500000 docs.

I called this RangeQuery variant and format "TrieRangeRange" query because
the idea looks like the well-known Trie structures (but it is not identical
to real tries, but algorithms are related to it).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

TrieUtils.java
07/Feb/09 17:25
3 kB
Yonik Seeley
TrieUtils.java
07/Feb/09 23:32
8 kB
Uwe Schindler
TrieUtils.java
08/Feb/09 14:35
9 kB
Uwe Schindler
TrieUtils.java
08/Feb/09 15:15
11 kB
Uwe Schindler
TrieUtils.java
08/Feb/09 15:37
13 kB
Yonik Seeley
TrieRangeFilter.java
08/Feb/09 14:39
11 kB
Uwe Schindler
trie.zip
08/Feb/09 17:19
9 kB
Uwe Schindler
LUCENE-1470-revamp.patch
09/Feb/09 23:49
115 kB
Uwe Schindler
LUCENE-1470-revamp.patch
11/Feb/09 00:05
149 kB
Uwe Schindler
LUCENE-1470-revamp.patch
13/Feb/09 17:15
153 kB
Uwe Schindler
LUCENE-1470-readme.patch
05/Dec/08 09:48
2 kB
Uwe Schindler
LUCENE-1470-apichange.patch
22/Feb/09 18:15
8 kB
Uwe Schindler
LUCENE-1470.patch
26/Nov/08 14:23
32 kB
Uwe Schindler
LUCENE-1470.patch
26/Nov/08 22:19
39 kB
Uwe Schindler
LUCENE-1470.patch
27/Nov/08 10:19
40 kB
Uwe Schindler
LUCENE-1470.patch
27/Nov/08 15:08
48 kB
Uwe Schindler
LUCENE-1470.patch
02/Dec/08 15:30
51 kB
Uwe Schindler
LUCENE-1470.patch
03/Dec/08 18:50
56 kB
Uwe Schindler
LUCENE-1470.patch
03/Dec/08 19:08
56 kB
Uwe Schindler
fixbuild-LUCENE-1470.patch
04/Dec/08 07:45
0.7 kB
Uwe Schindler
fixbuild-LUCENE-1470.patch
04/Dec/08 08:30
2 kB
Uwe Schindler

Issue Links

is related to

LUCENE-1461 Cached filter for a single term field

Closed

relates to

SOLR-940 TrieRange support

Closed

LUCENE-1372 Proposal: introduce more sensible sorting when a doc has multiple values for a term

Resolved

Activity

People

Assignee:: Uwe Schindler

Reporter:: Uwe Schindler

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 26/Nov/08 14:08

Updated:: 28/Aug/22 11:55

Resolved:: 13/Feb/09 18:29