Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-3107

SegmentWriter should be able to store blob IDs longer than 4096 bytes

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.0.18, 1.2.3, 1.3.3, 1.4
    • Component/s: None
    • Labels:
      None

      Description

      The SegmentWriter is able to store blob IDs that are no longer than 4096 bytes, but some implementation of BlobStore may return blob IDs 4096 bytes long (or more).

      It should be possible to use a different encoding for long blob IDs. The blob IDs should be written as a string (using SegmentWriter#writeString), and its reference ID embedded into a value record.

      The encoding in this case should be something like the following:

      11110 + 3bit + 3byte
      

      where the three least significant bits of the first bytes are actually unused, and the three bytes are used to store the record ID of the string representing the blob ID.

      This new encoding is necessary to maintain backwards compatibility with the current way of storing blob IDs and to give a way to SegmentBlob to recognise this new encoding.

        Attachments

        1. OAK-3107-01.patch
          17 kB
          Francesco Mari

          Issue Links

            Activity

              People

              • Assignee:
                mduerig Michael Dürig
                Reporter:
                frm Francesco Mari
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: