Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
A "failed" app attempt is one that failed due to an error in the user program, as opposed to one that was "killed" by the system. Like in MapReduce task attempts, we should distinguish the two so that killed attempts do not count against the number of retries (yarn.resourcemanager.am.max-retries).
Attachments
Issue Links
- is related to
-
YARN-128 [Umbrella] RM Restart Phase 1: State storage and non-work-preserving recovery
- Resolved