[CASSANDRA-2855] Skip rows with empty columns when slicing entire row - ASF JIRA

Agile Board

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Low
Resolution: Fixed
Fix Version/s: 0.8.8
Component/s: Legacy/CQL
Labels:
- hadoop

Description

We have been finding that range ghosts appear in results from Hadoop via Pig. This could also happen if rows don't have data for the slice predicate that is given. This leads to having to do a painful amount of defensive checking on the Pig side, especially in the case of range ghosts.

We would like to add an option to skip rows that have no column values in it. That functionality existed before in core Cassandra but was removed because of the performance penalty of that checking. However with Hadoop support in the RecordReader, that is batch oriented anyway, so individual row reading performance isn't as much of an issue. Also we would make it an optional config parameter for each job anyway, so people wouldn't have to incur that penalty if they are confident that there won't be those empty rows or they don't care.

It could be parameter cassandra.skip.empty.rows and be true/false.

Attachments

2855-v2.txt
26/Jul/11 21:10
1 kB
Jeremy Hanna
2855-v3.txt
28/Jul/11 22:12
3 kB
Jeremy Hanna
2855-v4.txt
03/Aug/11 23:20
4 kB
Jeremy Hanna
2855-v5.txt
24/Aug/11 16:53
3 kB
Brandon Williams
ASF.LICENSE.NOT.GRANTED--v1-0001-CASSANDRA-2855-ignore-ghosts-when-no-predicate-specifi.txt
08/Nov/11 20:47
5 kB
T Jake Luciani

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: T Jake Luciani Assign to me

Reporter:: Jeremy Hanna

Authors:: T Jake Luciani

Reviewers:: Brandon Williams

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 04/Jul/11 15:54

Updated:: 16/Apr/19 09:32

Resolved:: 10/Nov/11 18:38

Agile

View on Board

Skip rows with empty columns when slicing entire row

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment