Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-6806

RDBDocumentStore: version GC does not scale when there are many docs with long paths

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.7.14, 1.8.0
    • Component/s: rdbmk
    • Labels:
      None

      Description

      Due to the way the RDB RevisionGC looks for split documents, it doesn't perform well at all in the presence of many documents with long paths.

      The reason is that we currently do not have a column for SDTYPE, and thus use pattern matching on the document IDs instead. However, once a document has long path in the document store, it appears to the GC has candidate split document, and thus is always read upon GC.

        Attachments

        Issue Links

          Activity

            People

            • Assignee:
              reschke Julian Reschke
              Reporter:
              reschke Julian Reschke

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment