Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-7371

BKDReader could compress values better

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 6.2, 7.0
    • None
    • None
    • New

    Description

      For compressing values, BKDReader only relies on shared prefixes in a block. We could probably easily do better. For instance there are only 256 possible values for the first byte of the dimension that the values are sorted by, yet we use a block size of 1024. So by using something simple like run-length compression we could save 6 bits per value on average.

      Attachments

        1. LUCENE-7371.patch
          23 kB
          Adrien Grand
        2. LUCENE-7371.patch
          22 kB
          Adrien Grand
        3. LUCENE-7371.patch
          17 kB
          Adrien Grand

        Activity

          People

            jpountz Adrien Grand
            jpountz Adrien Grand
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: