Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4936

docvalues date compression

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.4
    • core/index
    • None
    • New

    Description

      DocValues fields can be very wasteful if you are storing dates (like solr's TrieDateField does if you enable docvalues) and don't actually need all the precision: e.g. "date-only" fields like date of birth with no time component, time fields without milliseconds precision, and so on.

      Ideally we'd compute GCD of all the values to save space (numberOfTrailingZeros is not really enough here), but i think we should at least look for values like 86400000, 3600000, and 1000 to be practical.

      Attachments

        1. LUCENE-4936.patch
          4 kB
          Robert Muir
        2. LUCENE-4936.patch
          30 kB
          Adrien Grand
        3. LUCENE-4936.patch
          47 kB
          Adrien Grand
        4. LUCENE-4936.patch
          48 kB
          Adrien Grand
        5. LUCENE-4936.patch
          48 kB
          Adrien Grand
        6. LUCENE-4936.patch
          49 kB
          Robert Muir
        7. LUCENE-4936.patch
          51 kB
          Robert Muir
        8. LUCENE-4936.patch
          53 kB
          Adrien Grand
        9. LUCENE-4936.patch
          53 kB
          Robert Muir

        Activity

          People

            jpountz Adrien Grand
            rcmuir Robert Muir
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: