Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
-
None
Description
Profiling 0.98.15 I see 20-30% of CPU time spent in Hadoop's PureJavaCrc32. Not surprising given previous results described on HBASE-11927. Backport.
There are two issues with the backport:
- The patch on 11927 changes the default CRC type from CRC32 to CRC32C. Although the changes are backwards compatible -files with either CRC type will be handled correctly in a transparent manner - we should probably leave the default alone in 0.98 and advise users on a site configuration change to use CRC32C if desired, for potential hardware acceleration.
- Need a shim for differences between Hadoop's DataChecksum type.
Attachments
Attachments
Issue Links
- is related to
-
HBASE-11927 Use Native Hadoop Library for HFile checksum (And flip default from CRC32 to CRC32C)
- Closed