Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.0.2-alpha
-
None
-
Reviewed
Description
When YARN reports a completed container to the MR AM, it always interprets it as a failure. This can lead to a job failing because too many of its tasks failed, when in fact they only failed because the scheduler preempted them.
MR needs to recognize the special exit code value of -100 and interpret it as a container being killed instead of a container failure.
Attachments
Attachments
Issue Links
- is depended upon by
-
MAPREDUCE-5900 Container preemption interpreted as task failures and eventually job failures
- Closed
- is related to
-
YARN-352 Inconsistent picture of how a container was killed when querying RM and NM in case of preemption
- Open
-
MAPREDUCE-4788 Job are marking as FAILED even if there are no failed tasks in it
- Open