Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-6806

RDBDocumentStore: version GC does not scale when there are many docs with long paths

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.7.14, 1.8.0
    • Component/s: rdbmk
    • Labels:
      None

      Description

      Due to the way the RDB RevisionGC looks for split documents, it doesn't perform well at all in the presence of many documents with long paths.

      The reason is that we currently do not have a column for SDTYPE, and thus use pattern matching on the document IDs instead. However, once a document has long path in the document store, it appears to the GC has candidate split document, and thus is always read upon GC.

        Attachments

        1. OAK-6806.diff
          1 kB
          Julian Reschke

          Issue Links

            Activity

              People

              • Assignee:
                reschke Julian Reschke
                Reporter:
                reschke Julian Reschke
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: