Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
0.14.3
-
None
-
None
-
None
-
All
Description
Even when mapred.local.dir specifies a comma-separated list of partitions (typically one per physical disk), all tasks of the same job have current working directories that belong to only one partition. For side-effect tasks, that use local cwd as a scratch space, this overloads a single disk while other disks may be idle. Idially, each task should get a cwd on different partition. This is related to HADOOP-1991, but emphasizes performance impact.
Attachments
Issue Links
- is part of
-
HADOOP-2227 wrong usage of mapred.local.dir.minspacestart
- Closed