[MAPREDUCE-2636] Scheduling over disks horizontally - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: job submission
Labels:
None

Description

Based on this message: http://mail-archives.apache.org/mod_mbox/hadoop-hdfs-user/201106.mbox/browser

The JT schedules tasks on nodes based on metadata it gets from the NN. The namenode does not know on which disk a block resides. It might happen that on a node running 4 tasks, all read from the same disk. This can affect performance.

An optimization might be to schedule horizontally over disks instead of nodes. Any ideas?

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Evert Lammerts

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 01/Jul/11 08:29

Updated:: 07/Jan/13 22:12