[CASSANDRA-13241] Lower default chunk_length_in_kb from 64kb to 16kb - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 4.0-alpha1, 4.0
Component/s: Legacy/Core
Labels:
None

Description

Having a too low chunk size may result in some wasted disk space. A too high chunk size may lead to massive overreads and may have a critical impact on overall system performance.

In my case, the default chunk size lead to peak read IOs of up to 1GB/s and avg reads of 200MB/s. After lowering chunksize (of course aligned with read ahead), the avg read IO went below 20 MB/s, rather 10-15MB/s.

The risk of (physical) overreads is increasing with lower (page cache size) / (total data size) ratio.

High chunk sizes are mostly appropriate for bigger payloads pre request but if the model consists rather of small rows or small resultsets, the read overhead with 64kb chunk size is insanely high. This applies for example for (small) skinny rows.

Please also see here:
https://groups.google.com/forum/#!topic/scylladb-dev/j_qXSP-6-gY

To give you some insights what a difference it can make (460GB data, 128GB RAM):

Latency of a quite large CF: https://cl.ly/1r3e0W0S393L
Disk throughput: https://cl.ly/2a0Z250S1M3c
This shows, that the request distribution remained the same, so no "dynamic snitch magic": https://cl.ly/3E0t1T1z2c0J

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

CompactIntegerSequence.java
17/Oct/18 20:03
5 kB
Ariel Weisberg
CompactIntegerSequenceBench.java
17/Oct/18 18:18
4 kB
Ariel Weisberg
CompactSummingIntegerSequence.java
17/Oct/18 20:03
5 kB
Ariel Weisberg

Issue Links

is related to

CASSANDRA-14857 Use a more space efficient representation for compressed chunk offsets

Open

Activity

People

Assignee:: Ariel Weisberg

Reporter:: Benjamin Roth

Authors:: Ariel Weisberg

Reviewers:: Jon Haddad

Votes:: 0 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 21/Feb/17 07:21

Updated:: 15/May/20 08:02

Resolved:: 12/Nov/18 21:05