Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-14313

Replace/improve Hadoop's byte[] comparator

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.7.4
    • Fix Version/s: 3.2.0
    • Component/s: common
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Hi,
      Recently we were looking at the Lexicographic byte array comparison in HBase. We did microbenchmark for the byte array comparator of HADOOP ( https://github.com/hanborq/hadoop/blob/master/src/core/org/apache/hadoop/io/FastByteComparisons.java#L161 ) , HBase Vs the latest byte array comparator from guava ( https://github.com/google/guava/blob/master/guava/src/com/google/common/primitives/UnsignedBytes.java#L362 ) and observed that the guava main branch version is much faster.
      Specifically we see very good improvement when the byteArraySize%8 != 0 and also for large byte arrays. I will update the benchmark results using JMH for Hadoop vs Guava. For the jira on HBase, please refer HBASE-17877.

        Attachments

        1. HADOOP-14313.001.patch
          4 kB
          Vikas Vishwakarma
        2. HADOOP-14313.branch-2.7.001.patch
          3 kB
          Vikas Vishwakarma
        3. HADOOP-14313.branch-2.7.002.patch
          4 kB
          Vikas Vishwakarma

          Activity

            People

            • Assignee:
              vik.karma Vikas Vishwakarma
              Reporter:
              vik.karma Vikas Vishwakarma
            • Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: