[HDFS-3672] Expose disk-location information for blocks to enable better scheduling - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.0-alpha
Fix Version/s: 2.0.2-alpha
Component/s: None
Labels:
None

Hadoop Flags:

Reviewed

Description

Currently, HDFS exposes on which datanodes a block resides, which allows clients to make scheduling decisions for locality and load balancing. Extending this to also expose on which disk on a datanode a block resides would enable even better scheduling, on a per-disk rather than coarse per-datanode basis.

This API would likely look similar to Filesystem#getFileBlockLocations, but also involve a series of RPCs to the responsible datanodes to determine disk ids.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

design-doc-v1.pdf
02/Aug/12 00:16
73 kB
Andrew Wang
design-doc-v2.pdf
09/Aug/12 05:48
73 kB
Andrew Wang
hdfs-3672-1.patch
20/Jul/12 03:30
37 kB
Andrew Wang
hdfs-3672-10.patch
16/Aug/12 20:06
68 kB
Andrew Wang
hdfs-3672-11.patch
16/Aug/12 20:37
68 kB
Andrew Wang
hdfs-3672-12.patch
17/Aug/12 01:02
69 kB
Andrew Wang
hdfs-3672-2.patch
26/Jul/12 00:57
48 kB
Andrew Wang
hdfs-3672-3.patch
26/Jul/12 21:59
49 kB
Andrew Wang
hdfs-3672-4.patch
31/Jul/12 21:04
52 kB
Andrew Wang
hdfs-3672-5.patch
03/Aug/12 21:59
52 kB
Andrew Wang
hdfs-3672-6.patch
07/Aug/12 00:52
60 kB
Andrew Wang
hdfs-3672-7.patch
07/Aug/12 17:19
61 kB
Andrew Wang
hdfs-3672-8.patch
07/Aug/12 21:11
61 kB
Andrew Wang
hdfs-3672-9.patch
14/Aug/12 00:32
68 kB
Andrew Wang

Issue Links

is depended upon by

HBASE-6572 Tiered HFile storage

Closed

is related to

HDFS-3969 Small bug fixes and improvements for disk locations API

Closed

MAPREDUCE-4577 HDFS-3672 broke TestCombineFileInputFormat.testMissingBlocks() test

Closed

HDFS-2832 Enable support for heterogeneous storages in HDFS - DN as a collection of storages

Closed

HDFS-8895 Remove deprecated BlockStorageLocation APIs

Resolved

HBASE-6572 Tiered HFile storage

Closed

is superceded by

HDFS-8887 Expose storage type and storage ID in BlockLocation

Resolved

(1 is related to, 1 is superceded by)

Activity

People

Assignee:: Andrew Wang

Reporter:: Andrew Wang

Votes:: 0 Vote for this issue

Watchers:: 30 Start watching this issue

Dates

Created:: 16/Jul/12 21:05

Updated:: 13/Aug/15 22:51

Resolved:: 17/Aug/12 17:04