Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5046

Explore preset dictionaries for CompressingStoredFieldsFormat

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Duplicate
    • None
    • None
    • None
    • None
    • New

    Description

      I discussed this possible improvement with Stefan Pohl and Andrzej BiaƂecki at Berlin Buzzwords: By having preset dictionaries (which could be user-provided and/or computed on a per-block basis), decompression could be faster since we would never have to decompress several documents from a block in order to access a single document.

      One drawback is that it would require putting some boundaries in the compressed stream, so it would maybe decrease a little the compression ratio. But then if decompression is faster, we could also afford larger blocks, so I think this is worth exploring.

      Attachments

        Issue Links

          Activity

            People

              jpountz Adrien Grand
              jpountz Adrien Grand
              Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: