Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-11421

Graceful Decommission ignores launched containers and gets deactivated before timeout

    XMLWordPrintableJSON

Details

    Description

      During Graceful Decommission, a Node gets deactivated before timeout even though there are launched containers on that node.

      We have observed cases when graceful decommission signal is sent to node and Containers are launched at NodeManager and at the same time, in such cases ResourceManager moves the node from Decommissioning to Decommissioned state because launced containers are not checked in DecommissioningNodesWatcher.

      We will suggest waiting for yarn.resourcemanager.decommissioning-nodes-watcher.delay-ms to complete before marking node ready to be decommissioned. No delay if set to 0. Expire interval should not be configured more than RM_AM_EXPIRY_INTERVAL_MS.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              abhishekd0907 Abhishek Dixit
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: