Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-3893

SegmentWriter records cache could use thinner keys

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Later
    • None
    • None
    • segment-tar

    Description

      The SegmentWriter keeps a records deduplication cache ('records' map) that maintains 2 types of mappings:

      • template -> recordid
      • strings -> recordid

      For the first one (template-> recordid) we can come up with a thinner representation of a template (a hash function that is fast and not very collision prone) so we don't have to keep a reference to each template object.

      Same applies for second one, similar to what is happening in the StringsCache now, we could keep the string value up to a certain size and beyond that, hash it and use that for the deduplication map.

      Attachments

        1. OAK-3893.patch
          7 kB
          Alex Deparvu

        Issue Links

          Activity

            People

              stillalex Alex Deparvu
              stillalex Alex Deparvu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: