Details
-
Improvement
-
Status: Resolved
-
Normal
-
Resolution: Fixed
-
None
-
None
Description
A local node can be paused for many reasons such as GC, and if the pause is long enough when it recovers it will think all the other nodes are dead until it gossips, causing UAE to be thrown to clients trying to use it as a coordinator. Instead, the FD can track the current time, and if the gap there becomes too large, skip marking the nodes down (reset the FD data perhaps)
Attachments
Attachments
Issue Links
- is duplicated by
-
CASSANDRA-9218 Node thinks other nodes are down after heavy GC
- Resolved
- is related to
-
CASSANDRA-9446 Failure detector should ignore local pauses per endpoint
- Resolved