[CASSANDRA-5932] Speculative read performance data show unexpected results - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 2.0.2
Component/s: None
Labels:
None

Severity:
Normal

Description

I've done a series of stress tests with eager retries enabled that show undesirable behavior. I'm grouping these behaviours into one ticket as they are most likely related.

1) Killing off a node in a 4 node cluster actually increases performance.
2) Compactions make nodes slow, even after the compaction is done.
3) Eager Reads tend to lessen the immediate performance impact of a node going down, but not consistently.

My Environment:
1 stress machine: node0
4 C* nodes: node4, node5, node6, node7

My script:
node0 writes some data: stress -d node4 -F 30000000 -n 30000000 -i 5 -l 2 -K 20
node0 reads some data: stress -d node4 -n 30000000 -o read -i 5 -K 20

Examples:

A node going down increases performance:

Data for this test here

At 450s, I kill -9 one of the nodes. There is a brief decrease in performance as the snitch adapts, but then it recovers... to even higher performance than before.

Compactions make nodes permanently slow:

The green and orange lines represent trials with eager retry enabled, they never recover their op-rate from before the compaction as the red and blue lines do.

Data for this test here

Speculative Read tends to lessen the immediate impact:

This graph looked the most promising to me, the two trials with eager retry, the green and orange line, at 450s showed the smallest dip in performance.

Data for this test here

But not always:

This is a retrial with the same settings as above, yet the 95percentile eager retry (red line) did poorly this time at 450s.

Data for this test here

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

5932.6692c50412ef7d.compaction.png
01/Oct/13 15:14
66 kB
Ryan McGuire
5932.6692c50412ef7d.rr0.png
01/Oct/13 20:15
99 kB
Ryan McGuire
5932.6692c50412ef7d.rr1.png
02/Oct/13 15:49
100 kB
Ryan McGuire
5932.ded39c7e1c2fa.logs.tar.gz
29/Sep/13 16:16
536 kB
Ryan McGuire
5932.txt
25/Sep/13 01:36
23 kB
Aleksey Yeschenko
5932-6692c50412ef7d.png
29/Sep/13 21:27
76 kB
Ryan McGuire
5933-128_and_200rc1.png
27/Sep/13 14:55
77 kB
Ryan McGuire
5933-7a87fc11.png
27/Sep/13 14:55
83 kB
Ryan McGuire
5933-logs.tar.gz
27/Sep/13 14:55
565 kB
Ryan McGuire
5933-randomized-dsnitch-replica.2.png
27/Sep/13 18:42
79 kB
Ryan McGuire
5933-randomized-dsnitch-replica.3.png
27/Sep/13 21:13
68 kB
Ryan McGuire
5933-randomized-dsnitch-replica.png
27/Sep/13 17:14
67 kB
Ryan McGuire
compaction-makes-slow.png
24/Aug/13 00:51
50 kB
Ryan McGuire
compaction-makes-slow-stats.png
24/Aug/13 01:04
32 kB
Ryan McGuire
eager-read-looks-promising.png
24/Aug/13 00:51
53 kB
Ryan McGuire
eager-read-looks-promising-stats.png
24/Aug/13 01:04
31 kB
Ryan McGuire
eager-read-not-consistent.png
24/Aug/13 00:51
61 kB
Ryan McGuire
eager-read-not-consistent-stats.png
24/Aug/13 01:04
31 kB
Ryan McGuire
node-down-increase-performance.png
24/Aug/13 00:51
32 kB
Ryan McGuire

Issue Links

is related to

CASSANDRA-4705 Speculative execution for reads / eager retries

Resolved

Activity

People

Assignee:: Aleksey Yeschenko

Reporter:: Ryan McGuire

Authors:: Aleksey Yeschenko

Reviewers:: Jonathan Ellis

Votes:: 1 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 24/Aug/13 00:51

Updated:: 16/Apr/19 09:32

Resolved:: 26/Sep/13 20:55