[CASSANDRA-9914] Millions of fake pending compaction tasks + high CPU - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Normal
Resolution: Duplicate
Fix Version/s: None
Component/s: None
Labels:
None
Environment:

CentOS

Severity:
Normal
Since Version:

2.1.7

Description

We have a 3-node test cluster (initially running 2.1.8) with zero traffic and about 10GB of data on each node. It's showing millions of pending compaction tasks (but no actual work in progress), and the CPUs are pegged on all three nodes. The task count goes down rapidly, but then jumps back up again seconds later. All tables are set to STCS. The issue persists after restart, but takes a few minutes before it becomes a problem. SSTable counts are below 10 for every table. We're also seeing 20s Old Gen GC pauses about every 2-3 mins.

This started happening after bulk loading some old data. We started seeing very long GC pauses (sometimes 30 min or more) that would bring down the nodes. We then truncated this table, which resulted in the current behavior. We attempted to roll back our cluster to 2.1.7 patched with ~~CASSANDRA-9637~~, but we observed the same behavior.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

high_pending_compactions.txt
28/Jul/15 19:42
3 kB
Robert Strickland
cass_high_cpu.png
28/Jul/15 19:44
212 kB
Robert Strickland

Issue Links

duplicates

CASSANDRA-9662 compactionManager reporting wrong pendingtasks

Resolved

Activity

People

Assignee:: Marcus Eriksson

Reporter:: Robert Strickland

Authors:: Marcus Eriksson

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 28/Jul/15 19:37

Updated:: 16/Apr/19 09:31

Resolved:: 29/Jul/15 15:30