Details
Description
Consider the following scenario:
1. RM assigns a container on node N to an app A.
2. Node N is restarted
3. A tries to launch container on node N.
3 could lead to an NMNotYetReadyException depending on whether NM N has registered with the RM. In MR, this is considered a task attempt failure. A few of these could lead to a task/job failure.
Attachments
Attachments
Issue Links
- relates to
-
YARN-3839 Quit throwing NMNotYetReadyException
- Resolved