[HDFS-822] Appends to already-finalized blocks can rename across volumes - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: 0.21.0, 0.22.0
Fix Version/s: 0.21.0
Component/s: datanode
Labels:
None

Hadoop Flags:

Reviewed

Description

This is a performance thing. As I understand the code in FSDataset.append, if the block is already finalized, it needs to move it into the RBW directory so it can go back into a "being written" state. This is done using volumes.getNextVolume without preference to the volume that the block currently exists on. It seems to me that this could cause a lot of slow cross-volume copies on applications that periodically append/close/append/close a file. Instead, getNextVolume could provide an alternate form that gives preference to a particular volume, so the rename stays on the same disk.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-822.patch
14/Jan/10 22:24
0.8 kB
Hairong Kuang
HDFS-822.patch
15/Jan/10 22:43
0.7 kB
Hairong Kuang
HDFS-822.patch
19/Jan/10 22:05
3 kB
Hairong Kuang

Activity

People

Assignee:: Hairong Kuang

Reporter:: Todd Lipcon

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 10/Dec/09 00:48

Updated:: 24/Aug/10 20:50

Resolved:: 20/Jan/10 01:20