Uploaded image for project: 'Apache Cassandra'
  1. Apache Cassandra
  2. CASSANDRA-515

Gossiper misses first updates when restarting a node

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Low
    • Resolution: Fixed
    • 0.5
    • None
    • None
    • Low

    Description

      Easy way to reproduce:

      Start node A.
      Start node B, with autobootstrap=false.
      Kill B, wipe data dir, and restart (still w/ autobootstrap=false).

      A will show B as down, with its old token. (B will see both nodes correctly.)

      This appears to be because when you wipe data dir, generation restarts at 1. (This is not just operator error; besides during testing, this could arise if a node dies completely and has to be replaced.) Then gossip state is ignored until the new heartbeat is larger than the one previously reached.

      It appears that initializing the generation to seconds-since-epoch would fix this.

      Attachments

        1. 515.patch
          4 kB
          Jonathan Ellis

        Activity

          People

            jbellis Jonathan Ellis
            jbellis Jonathan Ellis
            Jonathan Ellis
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: