[HADOOP-2758] Reduce memory copies when data is read from DFS - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.17.0
Component/s: None
Labels:
None

Release Note:
DataNode takes 50% less CPU while serving data to clients.

Description

Currently datanode and client part of DFS perform multiple copies of data on the 'read path' (i.e. path from storage on datanode to user buffer on the client). This jira reduces these copies by enhancing data read protocol and implementation of read on both datanode and the client. I will describe the changes in next comment.

Requirement is that this fix should reduce CPU used and should not cause regression in any benchmarks. It might not improve the benchmarks since most benchmarks are not cpu bound.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-2758.patch
29/Feb/08 02:38
33 kB
Raghu Angadi
HADOOP-2758.patch
26/Feb/08 22:22
33 kB
Raghu Angadi
HADOOP-2758.patch
26/Feb/08 03:56
33 kB
Raghu Angadi
HADOOP-2758.patch
22/Feb/08 21:30
33 kB
Raghu Angadi
HADOOP-2758.patch
19/Feb/08 22:20
28 kB
Raghu Angadi
HADOOP-2758.patch
13/Feb/08 22:36
21 kB
Raghu Angadi

Issue Links

is depended upon by

HADOOP-1702 Reduce buffer copies when data is written to DFS

Closed

relates to

HADOOP-2154 Non-interleaved checksums would optimize block transfers.

Resolved

HDFS-354 Data node process consumes 180% cpu

Resolved

Activity

People

Assignee:: Raghu Angadi

Reporter:: Raghu Angadi

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 31/Jan/08 21:24

Updated:: 08/Jul/09 16:42

Resolved:: 04/Mar/08 19:11