Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-6806

RDBDocumentStore: version GC does not scale when there are many docs with long paths

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.7.14, 1.8.0
    • rdbmk
    • None

    Description

      Due to the way the RDB RevisionGC looks for split documents, it doesn't perform well at all in the presence of many documents with long paths.

      The reason is that we currently do not have a column for SDTYPE, and thus use pattern matching on the document IDs instead. However, once a document has long path in the document store, it appears to the GC has candidate split document, and thus is always read upon GC.

      Attachments

        1. OAK-6806.diff
          1 kB
          Julian Reschke

        Issue Links

          Activity

            People

              reschke Julian Reschke
              reschke Julian Reschke
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: