[LUCENE-328] Some utilities for a compact sparse filter - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Minor
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: core/search
Labels:
None
Environment:

Operating System: other
Platform: Other

Bugzilla Id:
32921

Description

Two files are attached that might form the basis for an alternative
filter implementation that is more memory efficient than one bit
per doc when less than about 1/8 of the docs pass through the filter.

The document numbers are stored in RAM as VInt's from the Lucene index
format. These VInt's encode the difference between two successive
document numbers, much like a PositionDelta in the Positions:
http://jakarta.apache.org/lucene/docs/fileformats.html

The getByteSize() method can be used to verify the compression
once a SortedVIntList is constructed.
The precise conditions under which this is more memory efficient than
one bit per document are not easy to specify in advance.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SkipFilter1.patch
15/May/06 23:17
4 kB
Paul Elschot
IntArraySortedIntList.java
22/Nov/05 07:07
3 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--TestSortedVIntList.java
06/Jan/05 03:21
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--TestSortedVIntList.java
17/Jan/05 02:06
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--TestSortedVIntList.java
09/Feb/05 04:46
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--TestDocNrSkippers.java
21/Jun/05 18:21
6 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--TestDocNrSkippers.java
21/Jun/05 19:53
6 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--SortedVIntList.java
04/Jan/05 00:40
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--SortedVIntList.java
17/Jan/05 02:04
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--SortedVIntList.java
09/Feb/05 04:44
4 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--OrDocNrSkipper.java
21/Jun/05 18:20
2 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--OrDocNrSkipper.java
21/Jun/05 19:53
2 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--IntArraySortedIntList.java
21/Jun/05 18:19
3 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--DocNrSkipper.java
17/Jan/05 02:00
1 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--DocNrSkipper.java
09/Feb/05 04:43
1 kB
Paul Elschot
ASF.LICENSE.NOT.GRANTED--BitSetSortedIntList.java
21/Jun/05 18:20
1 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--AndDocNrSkipper.java
21/Jun/05 18:21
2 kB
Mark Harwood
ASF.LICENSE.NOT.GRANTED--AndDocNrSkipper.java
21/Jun/05 19:52
2 kB
Mark Harwood

Issue Links

is related to

LUCENE-584 Decouple Filter from BitSet

Closed

Activity

People

Assignee:: Unassigned

Reporter:: Paul Elschot

Votes:: 5 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 04/Jan/05 00:38

Updated:: 28/Aug/22 11:20

Resolved:: 28/Jun/06 01:38