[CASSANDRA-19949] Count performance regression in Cassandra 4.x - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Triage Needed
Priority: Normal
Resolution: Unresolved
Fix Version/s: 4.0.x, 4.1.x
Component/s: None
Labels:
None

Platform:

All
Impacts:

None

Description

Cassandra 4 exhibit a severe drop of performance on count operations.

We created a reproduction workflow inserting a 100k rows of 10kb random string

After this data is inserted in a 3 nodes cluster at RF3 and queried at LQ, a count on said table takes

circa 2s on 3.11
consistently more than 10s on 4.0 and 4.1 (around 12 to 13s) - tested 4.0.10 and 4.1.5

Observation of same program/query against each environment:

3.11

# COUNT #
61a5bcb0-75ca-11ef-9cff-55d571fe1347
Row count:100000
Count timing with fetch 5000: 0:00:01.846531
Average row size: 10000.0

4.1

# COUNT #
55d79f60-75cb-11ef-a8be-399c3e257132
Row count:100000
Count timing with fetch 5000: 0:00:13.408626
Average row size: 10000.0

The UUID shown in the above output is the trace ID on execution of the query which is then exported from each cluster via the command below and provide the cassXXtrace.txt file

cqlsh -e show session [trace_id] | tee cassXXtrace.txt

Attached cass311trace.txt and cass41trace.txt which show the associated events from above query.

Note the issue is way more prevalent in a 3 nodes cluster (I also have tested on docker in one node and it's less visible).

Attaching objcount.py which contains 2 functions to insert and read the data. The insert is pretty slow due to generating random junk 10k objects but allows to reproduce. Just comment out the gateway_insert function for it to trigger data insert.

    # gateway_insert(session, ks, tbl)
    gateway_query(session, ks, tbl, fetch)

Requires argparse and cassandra driver
To use, run the following command. Consider uncommenting l.40 and 41 for ks/table creation and l. 155 for insert workload

python3 ./objcount.py -i <ip> -k <ks> -t <table>

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

cass311count.txt
24/Sep/24 13:37
0.2 kB
Romain Anselin
cass311debugcount.txt
24/Sep/24 13:37
27 kB
Romain Anselin
cass311trace.txt
24/Sep/24 13:37
89 kB
Romain Anselin
cass400trace.txt
26/Sep/24 15:34
99 kB
Romain Anselin
cass41count.txt
24/Sep/24 13:37
0.2 kB
Romain Anselin
cass41debugcount.txt
24/Sep/24 13:37
32 kB
Romain Anselin
cass41trace.txt
24/Sep/24 13:37
99 kB
Romain Anselin
objcount-1.py
24/Sep/24 13:45
6 kB
Romain Anselin

Activity

People

Assignee:: Unassigned

Reporter:: Romain Anselin

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 24/Sep/24 13:47

Updated:: 24/Oct/24 16:42