Details
-
Bug
-
Status: Resolved
-
Normal
-
Resolution: Not A Problem
-
None
-
None
-
Normal
Description
Set up a test cluster with 2 DC's, heavy use of LCS (not sure if that's relevant).
Kick off cassandra-stress against it.
Kick of an automated incremental repair cycle.
After a bit a node starts flapping which causes a few repairs to fail. This is never cleared out of pending repairs - given the keyspace is replicated to all nodes it means they all have pending repairs that will never complete. Repairs are basically blocked at this point.
Given we're using Incremental repairs you're now spammed with:
"Cannot start multiple repair sessions over the same sstables"
Cluster and logs are still available for review - message me for details.