[CASSANDRA-14363] system_distributed.repair_history is not properly updated if parent dies - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Low
Resolution: Unresolved
Fix Version/s: 5.x
Component/s: Consistency/Repair
Labels:
None

Severity:
Low

Description

When we start a repair on a node, the information is written to system_distributed.repair_history. If the node running it happens to be a parent (the one holding the repair session) and it dies, the entries for the repair that was running will be stuck in "STARTED" state without being updated.

To resolve this, the node should check on start whether it was a parent before crash/restart, and if there are entries in the table (and in system_distributed.parent_repair_history too), and mark those entries as FAILED.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Alex Lourie

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 04/Apr/18 04:02

Updated:: 07/Mar/23 10:54