[HADOOP-10681] Remove synchronized blocks from SnappyCodec and ZlibCodec buffering inner loop - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.2.0, 2.4.0, 2.5.0
Fix Version/s: 2.6.0
Component/s: performance
Labels:
- perfomance

Release Note:
Remove unnecessary synchronized blocks from Snappy/Zlib codecs.

Description

The current implementation of SnappyCompressor spends more time within the java loop of copying from the user buffer into the direct buffer allocated to the compressor impl, than the time it takes to compress the buffers.

The bottleneck was found to be java monitor code inside SnappyCompressor.

The methods are neatly inlined by the JIT into the parent caller (BlockCompressorStream::write), which unfortunately does not flatten out the synchronized blocks.

The loop does a write of small byte[] buffers (each IFile key+value).

I counted approximately 6 monitor enter/exit blocks per k-v pair written.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

compress-cmpxchg-small.png
11/Jun/14 17:42
164 kB
Gopal Vijayaraghavan
HADOOP-10681.1.patch
14/Jun/14 02:43
16 kB
Gopal Vijayaraghavan
HADOOP-10681.2.patch
02/Jul/14 07:07
13 kB
Gopal Vijayaraghavan
HADOOP-10681.3.patch
02/Jul/14 19:13
18 kB
Gopal Vijayaraghavan
HADOOP-10681.4.patch
12/Jul/14 21:32
20 kB
Gopal Vijayaraghavan
perf-top-spill-merge.png
11/Jun/14 17:42
126 kB
Gopal Vijayaraghavan
snappy-perf-unsync.png
12/Jun/14 07:11
95 kB
Gopal Vijayaraghavan

Issue Links

is duplicated by

HADOOP-10116 fix "inconsistent synchronization" warnings in ZlibCompressor

Resolved

links to

RB #22602

Activity

People

Assignee:: Gopal Vijayaraghavan

Reporter:: Gopal Vijayaraghavan

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Dates

Created:: 11/Jun/14 17:41

Updated:: 08/Sep/16 06:49

Resolved:: 05/Oct/14 14:49