Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-679

Improved encodings for arrays

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Incomplete
    • None
    • None
    • spec
    • None

    Description

      There are better ways to encode arrays of varints [1] which are faster to decode, and more space efficient than encoding varints independently.

      Extending the idea to other types of variable length data like 'bytes' and 'string', you could encode the entries for an array block as an array of lengths, followed by contiguous byte/utf8 data.

      [1] group varint encoding: slides 57-63 of http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/people/jeff/WSDM09-keynote.pdf

      Attachments

        Activity

          People

            Unassigned Unassigned
            stuhood Stu Hood
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: