Details
-
Improvement
-
Status: Resolved
-
Normal
-
Resolution: Fixed
-
None
Description
Today, if there’s a digest mismatch in a foreground read repair, the insert to update out of date replicas is blocking. This means, if it fails, the read fails with a timeout. If a node is dropping writes (maybe it is overloaded or the mutation stage is backed up for some other reason), all reads to a replica set could fail. Further, replicas dropping writes get more out of sync so will require more read repair.
The comment on the code for why the writes are blocking is:
// wait for the repair writes to be acknowledged, to minimize impact on any replica that's // behind on writes in case the out-of-sync row is read multiple times in quick succession
but the bad side effect is that reads timeout. Either the writes should not be blocking or we should return success for the read even if the write times out.
Attachments
Issue Links
- breaks
-
CASSANDRA-14740 BlockingReadRepair does not maintain monotonicity during range movements
- Resolved
- is duplicated by
-
CASSANDRA-14480 Digest mismatch requires all replicas to be responsive
- Resolved
- is related to
-
CASSANDRA-14635 Support table level configuration of monotonic reads
- Resolved
- relates to
-
CASSANDRA-13863 Speculative retry causes read repair even if read_repair_chance is 0.0.
- Open
- links to