Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
In ContainerImpl.getLocalizedResources(), there's:
assert ContainerState.LOCALIZED == getContainerState(); // TODO: FIXME!!
ContainerImpl.getLocalizedResources() is called in ContainerLaunch.call(), which is scheduled on a separate thread. If the container is not at LOCALIZED (e.g. it is at KILLING, see YARN-906), an AssertError will be thrown and fails the thread without notifying NM. Therefore, the container cannot receive more events, which are supposed to be sent from ContainerLaunch.call(), and move towards completion.
Attachments
Attachments
Issue Links
- relates to
-
YARN-906 Cancelling ContainerLaunch#call at KILLING causes that the container cannot be completed
- Closed