[HADOOP-14376] Memory leak when reading a compressed file using the native library - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.7.0
Fix Version/s: 2.9.0, 2.7.4, 3.0.0-alpha4, 2.8.2
Component/s: common, io
Labels:
None

Hadoop Flags:

Reviewed

Description

Opening and closing a large number of bzip2-compressed input streams causes the process to be killed on OutOfMemory when using the native bzip2 library.

Our initial analysis suggests that this can be caused by DecompressorStream overriding the close() method, and therefore skipping the line from its parent: CodecPool.returnDecompressor(trackedDecompressor). When the decompressor object is a Bzip2Decompressor, its native end() method is never called, and the allocated memory isn't freed.

If this analysis is correct, the simplest way to fix this bug would be to replace in.close() with super.close() in DecompressorStream.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Bzip2MemoryTester.java
03/May/17 13:48
0.8 kB
Eli Acherkan
HADOOP-14376.001.patch
05/May/17 21:52
10 kB
Eli Acherkan
HADOOP-14376.002.patch
08/May/17 23:30
12 kB
Eli Acherkan
HADOOP-14376.003.patch
09/May/17 04:54
12 kB
Eli Acherkan
HADOOP-14376.004.patch
10/May/17 19:59
12 kB
Eli Acherkan
log4j.properties
03/May/17 13:48
0.3 kB
Eli Acherkan

Activity

People

Assignee:: Eli Acherkan

Reporter:: Eli Acherkan

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 03/May/17 13:39

Updated:: 24/Apr/18 20:49

Resolved:: 12/May/17 22:07