[YARN-5465] Server-Side NM Graceful Decommissioning subsequent call behavior - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: graceful
Labels:
None

Target Version/s:

3.5.0

Description

The Server-Side NM Graceful Decommissioning feature added by ~~YARN-4676~~ has the following behavior when subsequent calls are made:

Start a long-running job that has containers running on nodeA
Add nodeA to the exclude file
Run -refreshNodes -g 120 -server (2min) to begin gracefully decommissioning nodeA
Wait 30 seconds
Add nodeB to the exclude file
Run -refreshNodes -g 30 -server (30sec)
After 30 seconds, both nodeA and nodeB shut down

In a nutshell, issuing a subsequent call to gracefully decommission nodes updates the timeout for any currently decommissioning nodes. This makes it impossible to gracefully decommission different sets of nodes with different timeouts. Though it does let you easily update the timeout of currently decommissioning nodes.

Another behavior we could do is this:

Start a long-running job that has containers running on nodeA
# Add nodeA to the exclude file
Run -refreshNodes -g 120 -server (2min) to begin gracefully decommissioning nodeA
Wait 30 seconds
Add nodeB to the exclude file
Run -refreshNodes -g 30 -server (30sec)
After 30 seconds, nodeB shuts down
After 60 more seconds, nodeA shuts down

This keeps the nodes affected by each call to gracefully decommission nodes independent. You can now have different sets of decommissioning nodes with different timeouts. However, to update the timeout of a currently decommissioning node, you'd have to first recommission it, and then decommission it again.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Robert Kanter

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 02/Aug/16 16:40

Updated:: 04/Jan/24 08:57