[CASSANDRA-1316] Read repair does not always work correctly - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 0.6.4
Component/s: None
Labels:
None

Severity:
Normal

Description

Read repair does not always work. At the least, we allow violation of the CL.ALL contract. To reproduce, create a three node cluster with RF=3, and json2sstable one of the attached json files on each node. This creates a row whose key is 'test' with 9 columns, but only 3 columns are on each machine. If you get_count this row in quick succession at CL.ALL, sometimes you will receive a count of 6, sometimes 9. After the ReadRepairManager has sent the repairs, you will always get 9, which is the desired behavior.

I have another data set obtained in the wild which never fully repairs for some reason, but it's a bit large to attach (600ish columns per machine.) I'm still trying to figure out why RR isn't working on this set, but I always get different results when reading at any CL including ALL, no matter how long I wait or how many reads I do.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

cassandra-1.json
25/Jul/10 20:15
0.1 kB
Brandon Williams
cassandra-2.json
25/Jul/10 20:15
0.1 kB
Brandon Williams
cassandra-3.json
25/Jul/10 20:15
0.1 kB
Brandon Williams
001_correct_responsecount_in_RRR.txt
26/Jul/10 15:24
1 kB
Brandon Williams
RRR-v2.txt
26/Jul/10 16:08
3 kB
Jonathan Ellis
1316-RRM.txt
27/Jul/10 02:41
7 kB
Jonathan Ellis

Activity

People

Assignee:: Brandon Williams

Reporter:: Brandon Williams

Authors:: Brandon Williams

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 24/Jul/10 23:50

Updated:: 16/Apr/19 09:33

Resolved:: 27/Jul/10 16:47