[MAPREDUCE-68] Hadoop reduce scheduler sometimes leaves machines idle - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Not A Problem
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
None

Description

I have a MapReduce application with number of reducers equal to the number of machines in the cluster (and with speculative execution turned off). However, Hadoop schedules multiple reduces to run on single machines and leaves other machines idle. This causes contention and seriously slows down the job. Hadoop should employ the simple heuristic of utilizing as many machines as possible when scheduling reduces.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Nathan Marz

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 03/Feb/09 03:22

Updated:: 31/Dec/11 09:53

Resolved:: 31/Dec/11 09:53