Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
The blob store garbage collection (data store garbage collection) uses the chunk ids to identify binaries to be deleted. The blob ids contain the size now (<contentHash>#<size>), and the blob id is currently equal to the chunk id.
It would be more efficient to not use the size, and instead just use the content hash, for the chunk ids. That way, enumerating the entries that are in the store is potentially faster. Also, it allows us to change the blob id in the future, for example add more information to it (for example the creation time, or the first few bytes of the content) if we ever want to.
Attachments
Issue Links
- is related to
-
OAK-1667 Encode Blob length as part of blobId in DataStoreBlobStore
- Closed