[CASSANDRA-4288] prevent thrift server from starting before gossip has settled - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 2.0.5
Component/s: None
Labels:
None

Description

A serious problem is that there is no co-ordination whatsoever between gossip and the consumers of gossip. In particular, on a large cluster with hundreds of nodes, it takes several seconds for gossip to settle because the gossip stage is CPU bound. This leads to a node starting up and accessing thrift traffic long before it has any clue of what up and down. This leads to client-visible timeouts (for nodes that are down but not identified as such) and UnavailableException (for nodes that are up but not yet identified as such). This is really bad in general, but in particular for clients doing non-idempotent writes (counter increments).

I was going to fix this as part of more significant re-writing in other tickets having to do with gossip/topology/etc, but that's not going to happen. So, the attached patch is roughly what we're running with in production now to make restarts bearable. The minimum wait time is both for ensuring that gossip has time to start becoming CPU bound if it will be, and the reason it's large is to allow for down nodes to be identified as such in most typical cases with a default phi conviction threshold (untested, we actually ran with a smaller number of 5 seconds minimum, but from past experience I believe 15 seconds is enough).

The patch is tested on our 1.1 branch. It applies on trunk, and the diff is against trunk, but I have not tested it against trunk.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

CASSANDRA-4288-trunk.txt
25/May/12 19:57
2 kB
Peter Schuller
j4288-1.2-v1-txt
06/Dec/13 21:00
3 kB
Chris Burroughs
j4288-1.2-v2-txt
13/Dec/13 19:00
3 kB
Chris Burroughs
j4288-1.2-v3.txt
07/Jan/14 14:49
3 kB
Chris Burroughs

Issue Links

is duplicated by

CASSANDRA-6334 Option to not listen on Thrift/CQL if cluster is <N nodes

Resolved

is related to

CASSANDRA-6127 vnodes don't scale to hundreds of nodes

Resolved

CASSANDRA-9401 Better check for gossip stabilization on startup

Resolved

Activity

People

Assignee:: Chris Burroughs

Reporter:: Peter Schuller

Authors:: Chris Burroughs

Reviewers:: Tom Hobbs

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 25/May/12 19:56

Updated:: 16/Apr/19 09:32

Resolved:: 09/Jan/14 02:55