Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-9183

Failure detector should detect and ignore local pauses

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Normal
    • Resolution: Fixed
    • Fix Version/s: 2.1.6, 2.2.0 beta 1
    • Component/s: None
    • Labels:
      None

      Description

      A local node can be paused for many reasons such as GC, and if the pause is long enough when it recovers it will think all the other nodes are dead until it gossips, causing UAE to be thrown to clients trying to use it as a coordinator. Instead, the FD can track the current time, and if the gap there becomes too large, skip marking the nodes down (reset the FD data perhaps)

        Attachments

        1. 9183.txt
          2 kB
          Brandon Williams
        2. 9183-v2.txt
          2 kB
          Brandon Williams

          Issue Links

            Activity

              People

              • Assignee:
                brandon.williams Brandon Williams
                Reporter:
                brandon.williams Brandon Williams
                Authors:
                Brandon Williams
                Reviewers:
                Richard Low
              • Votes:
                0 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: