Lucene - Core
  1. Lucene - Core
  2. LUCENE-4609

Write a PackedIntsEncoder/Decoder for facets

    Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Not A Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: modules/facet
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      Today the facets API lets you write IntEncoder/Decoder to encode/decode the category ordinals. We have several such encoders, including VInt (default), and block encoders.

      It would be interesting to implement and benchmark a PackedIntsEncoder/Decoder, with potentially two variants: (1) receives bitsPerValue up front, when you e.g. know that you have a small taxonomy and the max value you can see and (2) one that decides for each doc on the optimal bitsPerValue, writes it as a header in the byte[] or something.

      1. LUCENE-4609.patch
        75 kB
        Adrien Grand
      2. LUCENE-4609.patch
        8 kB
        Michael McCandless
      3. LUCENE-4609.patch
        7 kB
        Michael McCandless
      4. LUCENE-4609.patch
        8 kB
        Michael McCandless
      5. LUCENE-4609.patch
        9 kB
        Gilad Barkai
      6. SemiPackedEncoder.patch
        12 kB
        Gilad Barkai

        Issue Links

          Activity

          Shai Erera created issue -
          Shai Erera made changes -
          Field Original Value New Value
          Summary Writer a PackedIntsEncoder/Decoder for facets Write a PackedIntsEncoder/Decoder for facets
          Gilad Barkai made changes -
          Attachment LUCENE-4609.patch [ 12561747 ]
          Adrien Grand made changes -
          Link This issue is related to LUCENE-4643 [ LUCENE-4643 ]
          Michael McCandless made changes -
          Attachment LUCENE-4609.patch [ 12565186 ]
          Michael McCandless made changes -
          Attachment LUCENE-4609.patch [ 12565962 ]
          Michael McCandless made changes -
          Attachment LUCENE-4609.patch [ 12566012 ]
          Adrien Grand made changes -
          Attachment LUCENE-4609.patch [ 12566641 ]
          Gilad Barkai made changes -
          Attachment SemiPackedEncoder.patch [ 12568423 ]
          Shai Erera made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Lucene Fields New [ 10121 ] New,Patch Available [ 10121, 10120 ]
          Resolution Not A Problem [ 8 ]

            People

            • Assignee:
              Unassigned
              Reporter:
              Shai Erera
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development