Details
-
Improvement
-
Status: Resolved
-
Low
-
Resolution: Fixed
-
None
Description
Many people have been reporting 'repair hang' when something goes wrong.
Two major causes of hang are 1) validation failure and 2) streaming failure.
Currently, when those failures happen, the failed node would not respond back to the repair initiator.
The goal of this ticket is to redesign message flows around repair so that repair never hang.
Attachments
Attachments
Issue Links
- is depended upon by
-
CASSANDRA-5393 Add retry mechanism to OTC for non-droppable_verbs
- Resolved
- relates to
-
CASSANDRA-3112 Make repair fail when an unexpected error occurs
- Resolved
-
CASSANDRA-11190 Fail fast repairs
- Open