Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-12046

Hadoop CRC implementation using Intel ISA-L library

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Intel ISA-L open source library provides set of highly optimized functions for RAID, erasure code, CRC, cryptographic hash, encryption, and compression. Ref. https://github.com/01org/isa-l. HDFS-EC has already integrated ISA-L and added the necessary building options support for Hadoop. For Hadoop CRC, we recently explored more, developing a Hadoop CRC using Intel ISA-L, performing a test on Broadwell and Skylake servers, comparing the performance against Hadoop native CRC. On Broadwell/Skylake, ISA-L CRC has about 8%~ performance gain over Hadoop native CRC. We suggest adding a new Hadoop native CRC using the ISA-L library, the extra advantage is it’s already optimized when we upgrade to new servers and Hadoop developers don’t have to maintain their own bunch of ASM codes.

      Attachments

        1. HDFS-12046-002.patch
          9 kB
          Wei-Chiu Chuang
        2. ISA-L CRC Performance Report using intel ISA-L.pdf
          242 kB
          luhuichun
        3. HDFS-12046-001.patch
          9 kB
          luhuichun

        Activity

          People

            Sammi Sammi Chen
            luhuichun luhuichun
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated: