[CASSANDRA-6364] There should be different disk_failure_policies for data and commit volumes or commit volume failure should always cause node exit - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 2.0.6
Component/s: None
Labels:
None
Environment:

JBOD, single dedicated commit disk

Description

We're doing fault testing on a pre-production Cassandra cluster. One of the tests was to simulation failure of the commit volume/disk, which in our case is on a dedicated disk. We expected failure of the commit volume to be handled somehow, but what we found was that no action was taken by Cassandra when the commit volume fail. We simulated this simply by pulling the physical disk that backed the commit volume, which resulted in filesystem I/O errors on the mount point.

What then happened was that the Cassandra Heap filled up to the point that it was spending 90% of its time doing garbage collection. No errors were logged in regards to the failed commit volume. Gossip on other nodes in the cluster eventually flagged the node as down. Gossip on the local node showed itself as up, and all other nodes as down.

The most serious problem was that connections to the coordinator on this node became very slow due to the on-going GC, as I assume uncommitted writes piled up on the JVM heap. What we believe should have happened is that Cassandra should have caught the I/O error and exited with a useful log message, or otherwise done some sort of useful cleanup. Otherwise the node goes into a sort of Zombie state, spending most of its time in GC, and thus slowing down any transactions that happen to use the coordinator on said node.

A limit on in-memory, unflushed writes before refusing requests may also work. Point being, something should be done to handle the commit volume dying as doing nothing results in affecting the entire cluster. I should note, we are using: disk_failure_policy: best_effort

Attachments

tmp-2.0.patch
11/Feb/14 09:53
16 kB
Benedict Elliott Smith

Issue Links

Add Link

is related to

CASSANDRA-9749 CommitLogReplayer continues startup after encountering errors

Resolved

Delete this link

requires

CASSANDRA-6652 Stop CommitLogSegment.close() from unnecessarily calling sync() prior to cleaning the buffer

Resolved

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Benedict Elliott Smith Assign to me

Reporter:: J. Ryan Earl

Authors:: Benedict Elliott Smith

Reviewers:: Marcus Eriksson

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 16/Nov/13 14:40

Updated:: 16/Apr/19 09:32

Resolved:: 13/Feb/14 21:59

Agile

View on Board

There should be different disk_failure_policies for data and commit volumes or commit volume failure should always cause node exit

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment