[HADOOP-14313] Replace/improve Hadoop's byte[] comparator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.7.4
Fix Version/s: 3.2.0
Component/s: common
Labels:
None

Target Version/s:

2.7.8
Hadoop Flags:

Reviewed

Description

Hi,
Recently we were looking at the Lexicographic byte array comparison in HBase. We did microbenchmark for the byte array comparator of HADOOP ( https://github.com/hanborq/hadoop/blob/master/src/core/org/apache/hadoop/io/FastByteComparisons.java#L161 ) , HBase Vs the latest byte array comparator from guava ( https://github.com/google/guava/blob/master/guava/src/com/google/common/primitives/UnsignedBytes.java#L362 ) and observed that the guava main branch version is much faster.
Specifically we see very good improvement when the byteArraySize%8 != 0 and also for large byte arrays. I will update the benchmark results using JMH for Hadoop vs Guava. For the jira on HBase, please refer ~~HBASE-17877~~.

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-14313.branch-2.7.001.patch
28/Apr/17 11:56
3 kB
Vikas Vishwakarma
HADOOP-14313.branch-2.7.002.patch
29/Apr/17 05:30
4 kB
Vikas Vishwakarma
HADOOP-14313.001.patch
09/May/17 07:53
4 kB
Vikas Vishwakarma

Activity

People

Assignee:: Vikas Vishwakarma

Reporter:: Vikas Vishwakarma

Votes:: 0 Vote for this issue

Watchers:: 14 Start watching this issue

Dates

Created:: 17/Apr/17 02:21

Updated:: 28/Jun/18 06:24

Resolved:: 28/Jun/18 05:59