[LUCENE-7371] BKDReader could compress values better - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 6.2, 7.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

For compressing values, BKDReader only relies on shared prefixes in a block. We could probably easily do better. For instance there are only 256 possible values for the first byte of the dimension that the values are sorted by, yet we use a block size of 1024. So by using something simple like run-length compression we could save 6 bits per value on average.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-7371.patch
12/Jul/16 13:33
23 kB
Adrien Grand
LUCENE-7371.patch
12/Jul/16 08:31
22 kB
Adrien Grand
LUCENE-7371.patch
06/Jul/16 13:21
17 kB
Adrien Grand

Activity

People

Assignee:: Adrien Grand

Reporter:: Adrien Grand

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 06/Jul/16 12:45

Updated:: 28/Aug/22 15:00

Resolved:: 12/Jul/16 16:05