Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.0.0
-
None
-
None
Description
Although with KUDU-236 we can now remove rows that were deleted prior to the ancient history mark, we don't actively schedule compactions based on deleted rows. So, if for example we have a fully compacted table and issue a DELETE for every row, the data size actually does not change, because no compactions are triggered.
We need some way to notice the fact that the ratio of deletes to rows is high and decide to compact those rowsets.
Attachments
Issue Links
- is duplicated by
-
KUDU-1979 Row GC maintenance task
- Resolved
- is related to
-
KUDU-3367 Delta file with full of delete op can not be schedule to compact
- Resolved
-
KUDU-1400 Improve rowset compaction policy to consider merging small DRSs
- Resolved
- relates to
-
KUDU-3619 The 'supplement to GC algorithm' breaks major delta compaction
- Resolved