[CASSANDRA-14210] Optimize SSTables upgrade task scheduling - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 3.0.17, 3.11.3, 4.0-alpha1, 4.0
Component/s: Local/Compaction
Labels:
None

Description

When starting the SSTable-rewrite process by running nodetool upgradesstables --jobs N, with N > 1, not all of the provided N slots are used.

For example, we were testing with concurrent_compactors=5 and N=4. What we observed both for version 2.2 and 3.0, is that initially all 4 provided slots are used for "Upgrade sstables" compactions, but later when some of the 4 tasks are finished, no new tasks are scheduled immediately. It takes the last of the 4 tasks to finish before new 4 tasks would be scheduled. This happens on every node we've observed.

This doesn't utilize available resources to the full extent allowed by the --jobs N parameter. In the field, on a cluster of 12 nodes with 4-5 TiB data each, we've seen that the whole process was taking more than 7 days, instead of estimated 1.5-2 days (provided there would be close to full N slots utilization).

Instead, new tasks should be scheduled as soon as there is a free compaction slot.
Additionally, starting from the biggest SSTables could further reduce the total time required for the whole process to finish on any given node.

Attachments

Activity

People

Assignee:: Kurt Greaves

Reporter:: Oleksandr Shulgin

Authors:: Kurt Greaves

Reviewers:: Marcus Eriksson

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 01/Feb/18 14:36

Updated:: 15/May/20 08:03

Resolved:: 05/Mar/18 08:18