Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.6.1
-
None
-
None
-
Incompatible change, Reviewed
-
Description
Even with MAPREDUCE-5817, there could still be cases where mappers get scheduled before all reducers are complete, but those mappers run for long time, even after all reducers are complete. This could hurt the performance of large MR jobs.
In some cases, mappers don't have any materialize-able outcome other than providing intermediate data to reducers. In that case, the job owner should have the config option to finish the job once all reducers are complete.
Attachments
Attachments
Issue Links
- is related to
-
MAPREDUCE-6937 Backport MAPREDUCE-6870 to branch-2 while preserving compatibility
- Resolved
- relates to
-
MAPREDUCE-6939 Follow-up on MAPREDUCE-6870
- Open
-
MAPREDUCE-5817 Mappers get rescheduled on node transition even after all reducers are completed
- Closed