Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-16213

Cannot replace_address /X because it doesn't exist in gossip

    XMLWordPrintableJSON

Details

    Description

      We see this exception around nodes crashing and trying to do a host replacement; this error appears to be correlated around multiple node failures.

      A simplified case to trigger this is the following

      *) Have a N node cluster
      *) Shutdown all N nodes
      *) Bring up N-1 nodes (at least 1 seed, else replace seed)
      *) Host replace the N-1th node -> this will fail with the above

      The reason this happens is that the N-1th node isn’t gossiping anymore, and the existing nodes do not have its details in gossip (but have the details in the peers table), so the host replacement fails as the node isn’t known in gossip.

      This affects all versions (tested 3.0 and trunk, assume 2.2 as well)

      Attachments

        Issue Links

          Activity

            People

              dcapwell David Capwell
              dcapwell David Capwell
              David Capwell
              Brandon Williams, Jon Meredith, Paulo Motta, Sam Tunnicliffe
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: