[CASSANDRA-8138] replace_address cannot find node to be replaced node after seed node restart - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Normal
Resolution: Won't Fix
Fix Version/s: None
Component/s: None
Labels:
None

Severity:
Normal
Since Version:

2.0.0

Description

If a node failed and a cluster was restarted (which is common case on massive outages), replace_address fails with

Caused by: java.lang.RuntimeException: Cannot replace_address /172.19.56.97 because it doesn't exist in gossip
jvm 1    | 	at org.apache.cassandra.service.StorageService.prepareReplacementInfo(StorageService.java:472)
jvm 1    | 	at org.apache.cassandra.service.StorageService.joinTokenRing(StorageService.java:724)
jvm 1    | 	at org.apache.cassandra.service.StorageService.initServer(StorageService.java:686)
jvm 1    | 	at org.apache.cassandra.service.StorageService.initServer(StorageService.java:562)

Although neccessary information is saved in system tables on seed nodes, it is not loaded to gossip on seed node, so a replacement node cannot get this info.

Attached patch loads all information from system tables to gossip with generation 0 and fixes some bugs around this info on shadow gossip round.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

ReplaceAfterSeedRestart.txt
18/Oct/14 08:22
8 kB
Oleg Anastasyev

Activity

People

Assignee:: Brandon Williams

Reporter:: Oleg Anastasyev

Authors:: Brandon Williams

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 18/Oct/14 08:22

Updated:: 16/Apr/19 09:31

Resolved:: 10/Nov/14 22:15