[OAK-3107] SegmentWriter should be able to store blob IDs longer than 4096 bytes - ASF JIRA

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.0.18, 1.2.3, 1.3.3, 1.4
Component/s: None
Labels:
None

Description

The SegmentWriter is able to store blob IDs that are no longer than 4096 bytes, but some implementation of BlobStore may return blob IDs 4096 bytes long (or more).

It should be possible to use a different encoding for long blob IDs. The blob IDs should be written as a string (using SegmentWriter#writeString), and its reference ID embedded into a value record.

The encoding in this case should be something like the following:

11110 + 3bit + 3byte

where the three least significant bits of the first bytes are actually unused, and the three bytes are used to store the record ID of the string representing the blob ID.

This new encoding is necessary to maintain backwards compatibility with the current way of storing blob IDs and to give a way to SegmentBlob to recognise this new encoding.

Attachments

OAK-3107-01.patch
16/Jul/15 07:03
17 kB
Francesco Mari

Issue Links

Add Link

blocks

OAK-1842 ISE: "Unexpected value record type: f2" is thrown when FileBlobStore is used

Closed

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Michael Dürig

Reporter:: Francesco Mari

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 15/Jul/15 09:45

Updated:: 02/Mar/16 11:55

Resolved:: 17/Jul/15 14:19

Agile

View on Board

SegmentWriter should be able to store blob IDs longer than 4096 bytes

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment