[CASSANDRA-6285] 2.0 HSHA server introduces corrupt data - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Urgent
Resolution: Fixed
Fix Version/s: 2.0.8
Component/s: None
Labels:
None
Environment:

4 nodes, shortly updated from 1.2.11 to 2.0.2

Severity:
Critical
Since Version:

2.0.0

Description

After altering everything to LCS the table OpsCenter.rollups60 amd one other none OpsCenter-Table got stuck with everything hanging around in L0.
The compaction started and ran until the logs showed this:
ERROR [CompactionExecutor:111] 2013-11-01 19:14:53,865 CassandraDaemon.java (line 187) Exception in thread Thread[CompactionExecutor:111,1,RMI Runtime]
java.lang.RuntimeException: Last written key DecoratedKey(1326283851463420237, 37382e34362e3132382e3139382d6a7576616c69735f6e6f72785f696e6465785f323031335f31305f30382d63616368655f646f63756d656e74736c6f6f6b75702d676574426c6f6f6d46696c746572537061636555736564) >= current key DecoratedKey(954210699457429663, 37382e34362e3132382e3139382d6a7576616c69735f6e6f72785f696e6465785f323031335f31305f30382d63616368655f646f63756d656e74736c6f6f6b75702d676574546f74616c4469736b5370616365557365640b0f) writing into /var/lib/cassandra/data/OpsCenter/rollups60/OpsCenter-rollups60-tmp-jb-58656-Data.db
at org.apache.cassandra.io.sstable.SSTableWriter.beforeAppend(SSTableWriter.java:141)
at org.apache.cassandra.io.sstable.SSTableWriter.append(SSTableWriter.java:164)
at org.apache.cassandra.db.compaction.CompactionTask.runWith(CompactionTask.java:160)
at org.apache.cassandra.io.util.DiskAwareRunnable.runMayThrow(DiskAwareRunnable.java:48)
at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
at org.apache.cassandra.db.compaction.CompactionTask.executeInternal(CompactionTask.java:60)
at org.apache.cassandra.db.compaction.AbstractCompactionTask.execute(AbstractCompactionTask.java:59)
at org.apache.cassandra.db.compaction.CompactionManager$6.runMayThrow(CompactionManager.java:296)
at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:724)

Moving back to STC worked to keep the compactions running.
Especialy my own Table i would like to move to LCS.
After a major compaction with STC the move to LCS fails with the same Exception.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

enable_reallocate_buffers.txt
09/May/14 19:50
1 kB
Brandon Williams
disruptor-memory-corruption.patch
10/Mar/14 09:33
0.6 kB
Viktor Kuzmin
disruptor-high-cpu.patch
10/Mar/14 09:33
2 kB
Viktor Kuzmin
compaction_test.py
28/Jan/14 18:24
3 kB
Russ Hatch
cassandra-attack-src.zip
10/Mar/14 09:42
8.56 MB
Viktor Kuzmin
CASSANDRA-6285-disruptor-heap.patch
04/Mar/14 19:40
1 kB
Pavel Yaskevich
6285_testnotes1.txt
06/Mar/14 20:29
115 kB
Michael Shuler

Activity

People

Assignee:: Pavel Yaskevich

Reporter:: David Sauer

Authors:: Pavel Yaskevich

Reviewers:: Brandon Williams

Tester:: Rick Branson

Votes:: 4 Vote for this issue

Watchers:: 34 Start watching this issue

Dates

Created:: 01/Nov/13 18:45

Updated:: 16/Apr/19 09:32

Resolved:: 19/May/14 21:39