[CASSANDRA-9264] Cassandra should not persist files without checksums - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Normal
Resolution: Unresolved
Fix Version/s: 5.x
Component/s: Legacy/Core
Labels:
None

Description

Even if checksums aren't validated on the read side every time it is helpful to have them persisted with checksums so that if a corrupted file is encountered you can at least validate that the issue is corruption and not an application level error that generated a corrupt file.

We should standardize on conventions for how to checksum a file and which checksums to use so we can ensure we get the best performance possible.

For a small checksum I think we should use CRC32 because the hardware support appears quite good.

For cases where a 4-byte checksum is not enough I think we can look at either xxhash64 or MurmurHash3.

The problem with xxhash64 is that output is only 8-bytes. The problem with MurmurHash3 is that the Java implementation is slow. If we can live with 8-bytes and make it easy to switch hash implementations I think xxhash64 is a good choice because we already ship a good implementation with LZ4.

I would also like to see hashes always prefixed by a type so that we can swap hashes without running into pain trying to figure out what hash implementation is present. I would also like to avoid making assumptions about the number of bytes in a hash field where possible keeping in mind compatibility and space issues.

Hashing after compression is also desirable over hashing before compression.

Attachments

Issue Links

is blocked by

CASSANDRA-6897 Add checksum to the Summary File and Bloom Filter file of SSTables

Open

CASSANDRA-9265 Add checksum to saved cache files

Resolved

relates to

CASSANDRA-10154 Uncompressed data files should serialize inline checksums, over smaller blocks

Open

supercedes

CASSANDRA-9120 OutOfMemoryError when read auto-saved cache (probably broken)

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Ariel Weisberg

Votes:: 0 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 29/Apr/15 17:34

Updated:: 07/Mar/23 10:53