[KAFKA-374] Move to java CRC32 implementation - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 0.8.0
Fix Version/s: None
Component/s: core
Labels:
- newbie

Description

We keep a per-record crc32. This is fairly cheap algorithm, but the java implementation uses JNI and it seems to be a bit expensive for small records. I have seen this before in Kafka profiles, and I noticed it on another application I was working on. Basically with small records the native implementation can only checksum < 100MB/sec. Hadoop has done some analysis of this and replaced it with a Java implementation that is 2x faster for large values and 5-10x faster for small values. Details are here ~~HADOOP-6148~~.

We should do a quick read/write benchmark on log and message set iteration and see if this improves things.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

KAFKA-374.patch
07/Dec/12 18:33
33 kB
David Arthur
KAFKA-374-draft.patch
27/Jun/12 00:01
33 kB
Jay Kreps

Activity

People

Assignee:: Jay Kreps

Reporter:: Jay Kreps

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 26/Jun/12 15:37

Updated:: 19/Jun/14 05:17

Resolved:: 16/Dec/12 19:42