Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
Description
As proposed in YARN-914, a new state of “DECOMMISSIONING” will be added and can transition from “running” state triggered by a new event - “decommissioning”.
This new state can be transit to state of “decommissioned” when Resource_Update if no running apps on this NM or NM reconnect after restart. Or it received DECOMMISSIONED event (after timeout from CLI).
In addition, it can back to “running” if user decides to cancel previous decommission by calling recommission on the same node. The reaction to other events is similar to RUNNING state.
Attachments
Attachments
Issue Links
- blocks
-
YARN-3223 Resource update during NM graceful decommission
- Resolved
- depends upon
-
YARN-3225 New parameter or CLI for decommissioning node gracefully in RMAdmin CLI
- Resolved
-
YARN-3445 Cache runningApps in RMNode for getting running apps on given NodeId
- Resolved
- is depended upon by
-
YARN-3224 Notify AM with containers (on decommissioning node) could be preempted after timeout.
- Resolved
- is duplicated by
-
YARN-670 Add an Exception to indicate 'Maintenance' for NMs and add this to the JavaDoc for appropriate protocols
- Resolved