Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
Faulty container crash detection code path in NodeManager.
Steps:
Run a job.
Kill the AM container in NM.
NM logs has:
org.apache.hadoop.yarn.state.InvalidStateTransitonException: Invalid event: CONTAINER_KILLED_ON_REQUEST at RUNNING
at org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:297)
at org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:39)
at org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:439)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl.handle(ContainerImpl.java:685)
Attachments
Issue Links
- is duplicated by
-
MAPREDUCE-3031 Job Client goes into infinite loop when we kill AM
- Closed