[LUCENE-4936] docvalues date compression - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.4
Component/s: core/index
Labels:
None

Lucene Fields:

New

Description

DocValues fields can be very wasteful if you are storing dates (like solr's TrieDateField does if you enable docvalues) and don't actually need all the precision: e.g. "date-only" fields like date of birth with no time component, time fields without milliseconds precision, and so on.

Ideally we'd compute GCD of all the values to save space (numberOfTrailingZeros is not really enough here), but i think we should at least look for values like 86400000, 3600000, and 1000 to be practical.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-4936.patch
22/Apr/13 18:19
53 kB
Robert Muir
LUCENE-4936.patch
21/Apr/13 19:37
53 kB
Adrien Grand
LUCENE-4936.patch
20/Apr/13 15:17
51 kB
Robert Muir
LUCENE-4936.patch
20/Apr/13 13:51
49 kB
Robert Muir
LUCENE-4936.patch
19/Apr/13 16:14
48 kB
Adrien Grand
LUCENE-4936.patch
19/Apr/13 15:30
48 kB
Adrien Grand
LUCENE-4936.patch
19/Apr/13 13:29
47 kB
Adrien Grand
LUCENE-4936.patch
18/Apr/13 15:58
30 kB
Adrien Grand
LUCENE-4936.patch
16/Apr/13 12:43
4 kB
Robert Muir

Activity

People

Assignee:: Adrien Grand

Reporter:: Robert Muir

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 16/Apr/13 12:42

Updated:: 28/Aug/22 13:44

Resolved:: 29/Apr/13 18:07