[CASSANDRA-2841] Always use even distribution for merkle tree with RandomPartitionner - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Low
Resolution: Fixed
Fix Version/s: 0.7.7, 0.8.2
Component/s: None
Labels:
- repair

Description

When creating the initial merkle tree, repair tries to be (too) smart and use the key samples to "guide" the tree splitting. While this is a good idea for OPP where there is a good change the data distribution is uneven, you can't beat an even distribution for the RandomPartitionner. And a quick experiment even shows that the method used is significantly less efficient than an even distribution for the ranges of the merkle tree (that is, an even distribution gives a much better of distribution of the number of keys by range of the tree).

Thus let's switch to an even distribution for RandomPartitionner. That 3 lines change alone amounts for a significant improvement of repair's precision.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

2841.patch
29/Jun/11 19:04
3 kB
Sylvain Lebresne

Activity

People

Assignee:: Sylvain Lebresne

Reporter:: Sylvain Lebresne

Authors:: Sylvain Lebresne

Reviewers:: Jonathan Ellis

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 29/Jun/11 19:03

Updated:: 16/Apr/19 09:32

Resolved:: 30/Jun/11 07:50