[HDFS-2834] ByteBuffer-based read API for DFSInputStream - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.2-alpha
Component/s: hdfs-client, performance
Labels:
None

Hadoop Flags:

Reviewed

Description

The DFSInputStream read-path always copies bytes into a JVM-allocated byte[]. Although for many clients this is desired behaviour, in certain situations, such as native-reads through libhdfs, this imposes an extra copy penalty since the byte[] needs to be copied out again into a natively readable memory area.

For these cases, it would be preferable to allow the client to supply its own buffer, wrapped in a ByteBuffer, to avoid that final copy overhead.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-2834.patch
02/Mar/12 00:23
66 kB
Henry Robinson
HDFS-2834.patch
02/Mar/12 00:35
64 kB
Henry Robinson
HDFS-2834-no-common.patch
02/Mar/12 22:22
60 kB
Henry Robinson
HDFS-2834.3.patch
03/Mar/12 01:26
63 kB
Henry Robinson
HDFS-2834.4.patch
03/Mar/12 09:09
58 kB
Henry Robinson
HDFS-2834.5.patch
03/Mar/12 20:57
59 kB
Henry Robinson
HDFS-2834.6.patch
03/Mar/12 22:45
59 kB
Henry Robinson
hdfs-2834-libhdfs-benchmark.png
05/Mar/12 08:26
43 kB
Henry Robinson
HDFS-2834.7.patch
06/Mar/12 08:54
61 kB
Henry Robinson
HDFS-2834.8.patch
06/Mar/12 23:05
61 kB
Henry Robinson
HDFS-2834.9.patch
09/Mar/12 00:39
49 kB
Henry Robinson
HDFS-2834.10.patch
20/Mar/12 16:29
48 kB
Henry Robinson
HDFS-2834.11.patch
21/Mar/12 06:36
60 kB
Henry Robinson

Issue Links

breaks

HDFS-3243 TestParallelRead timing out on jenkins

Closed

depends upon

HADOOP-8135 Add ByteBufferReadable interface to FSDataInputStream

Closed

is depended upon by

HDFS-3110 libhdfs implementation of direct read API

Closed

is related to

HBASE-8143 HBase on Hadoop 2 with local short circuit reads (ssr) causes OOM

Closed

HDFS-3053 Support for true zero-copy (mmap-based) reads

Resolved

HDFS-3051 A zero-copy ScatterGatherRead api from FSDataInputStream

Open

HBASE-21879 Read HFile's block to ByteBuffer directly instead of to byte for reducing young gc purpose

Closed

relates to

HADOOP-8148 Zero-copy ByteBuffer-based compressor / decompressor API

Open

HDFS-15693 Improve native code's performance when writing to HDFS

Open

HDFS-3246 pRead equivalent for direct read path

Resolved

(2 is related to, 3 relates to)

Activity

People

Assignee:: Henry Robinson

Reporter:: Henry Robinson

Votes:: 1 Vote for this issue

Watchers:: 31 Start watching this issue

Dates

Created:: 24/Jan/12 19:36

Updated:: 24/Nov/20 14:29

Resolved:: 21/Mar/12 17:31