[CASSANDRA-14812] Multiget Thrift query returns null records after digest mismatch - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Urgent
Resolution: Fixed
Fix Version/s: 3.0.19, 3.11.5, 4.0-alpha1, 4.0
Component/s: Consistency/Coordination, Messaging/Thrift
Labels:
None

Bug Category:
Correctness - Transient Incorrect Response
Severity:
Critical
Complexity:
Challenging
Discovered By:
User Report
Since Version:

3.0.0
Source Control Link:

97eae441dab742f0eaffcedc360991350232cfd6

Description

It seems that in Cassandra 3.0.0 a nasty bug was introduced in multiget Thrift query processing logic. When one tries to read data from several partitions with a single multiget query and DigestMismatch exception is raised during this query processing, request coordinator prematurely terminates response stream right at the point where the first DigestMismatch error is occurring. This leads to situation where clients "do not see" some data contained in the database.

We managed to reproduce this bug in all versions of Cassandra starting with v3.0.0. The pre-release version 3.0.0-rc2 works correctly. It looks like refactoring of iterator transformation hierarchy related to ~~CASSANDRA-9975~~ triggers incorrect behaviour.

When concatenated iterator is returned from the StorageProxy.fetchRows(...), Cassandra starts to consume this combined iterator. Because of DigestMismatch exception some elements of this combined iterator contain additional ThriftCounter, that was added during DataResolver.resolve(...) execution. While consuming iterator for many partitions Cassandra calls BaseIterator.tryGetMoreContents(...) method that must switch from one partition iterator to another in case of exhaustion of the former. In this case all Transformations contained in the next iterator are applied to the combined BaseIterator that enumerates partitions sequence which is wrong. This behaviour causes BaseIterator to stop enumeration after it fully consumes partition with DigestMismatch error, because this partition iterator has additional ThriftCounter data limit.

The attachment contains the python2 script small_repro_script.py that reproduces this bug within 3-nodes ccmlib controlled cluster. Also, there is an extended version of this script - repro_script.py - that contains more logging information and provides the ability to test behavior for many Cassandra versions (to run all test cases from repro_script.py you can call python -m unittest2 -v repro_script.ThriftMultigetTestCase). All the necessary dependencies contained in the requirements.txt

This bug is critical in our production environment because we can't permit any data skip.

Any ideas about a patch for this issue?

Attachments

small_repro_script.py
10/Oct/18 19:14
1 kB
Sivukhin Nikita
repro_script.py
10/Oct/18 19:16
5 kB
Sivukhin Nikita
requirements.txt
10/Oct/18 19:17
0.1 kB
Sivukhin Nikita
small_repro_script_cql.py
27/May/19 12:00
2 kB
Michael Semb Wever

Issue Links

Add Link

is duplicated by

CASSANDRA-14918 multiget_slice returning inconsistent results when performed with CL higher than ONE

Resolved

Delete this link

is related to

CASSANDRA-14918 multiget_slice returning inconsistent results when performed with CL higher than ONE

Resolved

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Benedict Elliott Smith Assign to me

Reporter:: Sivukhin Nikita

Authors:: Benedict Elliott Smith

Reviewers:: Michael Semb Wever

Votes:: 3 Vote for this issue

Watchers:: 11 Start watching this issue

Dates

Created:: 10/Oct/18 19:31

Updated:: 15/May/20 08:05

Resolved:: 02/Jul/19 10:22

Agile

View on Board

Multiget Thrift query returns null records after digest mismatch

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment