Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-14313

Replace/improve Hadoop's byte[] comparator

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.7.4
    • 3.2.0
    • common
    • None
    • Reviewed

    Description

      Hi,
      Recently we were looking at the Lexicographic byte array comparison in HBase. We did microbenchmark for the byte array comparator of HADOOP ( https://github.com/hanborq/hadoop/blob/master/src/core/org/apache/hadoop/io/FastByteComparisons.java#L161 ) , HBase Vs the latest byte array comparator from guava ( https://github.com/google/guava/blob/master/guava/src/com/google/common/primitives/UnsignedBytes.java#L362 ) and observed that the guava main branch version is much faster.
      Specifically we see very good improvement when the byteArraySize%8 != 0 and also for large byte arrays. I will update the benchmark results using JMH for Hadoop vs Guava. For the jira on HBase, please refer HBASE-17877.

      Attachments

        1. HADOOP-14313.branch-2.7.001.patch
          3 kB
          Vikas Vishwakarma
        2. HADOOP-14313.branch-2.7.002.patch
          4 kB
          Vikas Vishwakarma
        3. HADOOP-14313.001.patch
          4 kB
          Vikas Vishwakarma

        Activity

          People

            vik.karma Vikas Vishwakarma
            vik.karma Vikas Vishwakarma
            Votes:
            0 Vote for this issue
            Watchers:
            14 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: