Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Right now, the documentmk's version garbage collection runs in several phases.
It first collects the paths of candidate nodes, and only once this has been successfully finished, starts actually deleting nodes.
This can be a problem when the regularly scheduled garbage collection is interrupted during the path collection phase, maybe due to other maintenance tasks. On the next run, the number of paths to be collected will be even bigger, thus making it even more likely to fail.
We should think about a change in the logic that would allow the GC to run in chunks; maybe by partitioning the path space by top level directory.
Attachments
Attachments
Issue Links
- is blocked by
-
OAK-6020 add a Guava Stopwatch like duration formatter
- Closed
- is related to
-
OAK-8448 VersionGC may get stuck at 60s scope
- Closed
-
OAK-5898 Revision GC command line tool
- Closed
-
OAK-3070 Use a lower bound in VersionGC query to avoid checking unmodified once deleted docs
- Closed
-
OAK-5571 VersionGarbageCollector can remove leaf nodes eagerly
- Closed
-
OAK-6083 RDBDocumentStore: implement support for VersionGCSupport extensions added for OAK-4780
- Closed
-
OAK-6059 Revert VersionGC supplier change
- Closed