Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4609

Write a PackedIntsEncoder/Decoder for facets

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Minor
    • Resolution: Not A Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: modules/facet
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      Today the facets API lets you write IntEncoder/Decoder to encode/decode the category ordinals. We have several such encoders, including VInt (default), and block encoders.

      It would be interesting to implement and benchmark a PackedIntsEncoder/Decoder, with potentially two variants: (1) receives bitsPerValue up front, when you e.g. know that you have a small taxonomy and the max value you can see and (2) one that decides for each doc on the optimal bitsPerValue, writes it as a header in the byte[] or something.

        Attachments

        1. SemiPackedEncoder.patch
          12 kB
          Gilad Barkai
        2. LUCENE-4609.patch
          75 kB
          Adrien Grand
        3. LUCENE-4609.patch
          8 kB
          Michael McCandless
        4. LUCENE-4609.patch
          7 kB
          Michael McCandless
        5. LUCENE-4609.patch
          8 kB
          Michael McCandless
        6. LUCENE-4609.patch
          9 kB
          Gilad Barkai

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                shaie Shai Erera
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: