Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-14738

Backport HBASE-11927 (Use Native Hadoop Library for HFile checksum) to 0.98

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None

    Description

      Profiling 0.98.15 I see 20-30% of CPU time spent in Hadoop's PureJavaCrc32. Not surprising given previous results described on HBASE-11927. Backport.

      There are two issues with the backport:

      1. The patch on 11927 changes the default CRC type from CRC32 to CRC32C. Although the changes are backwards compatible -files with either CRC type will be handled correctly in a transparent manner - we should probably leave the default alone in 0.98 and advise users on a site configuration change to use CRC32C if desired, for potential hardware acceleration.
      2. Need a shim for differences between Hadoop's DataChecksum type.

      Attachments

        1. HBASE-14738-0.98.patch
          51 kB
          Andrew Kyle Purtell

        Issue Links

          Activity

            People

              Unassigned Unassigned
              apurtell Andrew Kyle Purtell
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: