[CASSANDRA-7168] Add repair aware consistency levels - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Add vote

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Normal
Resolution: Unresolved
Fix Version/s: 5.x
Component/s: Legacy/Coordination
Labels:
- performance
- repair

Description

With ~~CASSANDRA-5351~~ and ~~CASSANDRA-2424~~ I think there is an opportunity to avoid a lot of extra disk I/O when running queries with higher consistency levels.

Since repaired data is by definition consistent and we know which sstables are repaired, we can optimize the read path by having a REPAIRED_QUORUM which breaks reads into two phases:

1) Read from one replica the result from the repaired sstables.
2) Read from a quorum only the un-repaired data.

For the node performing 1) we can pipeline the call so it's a single hop.

In the long run (assuming data is repaired regularly) we will end up with much closer to CL.ONE performance while maintaining consistency.

Some things to figure out: