Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9192

Deletion Taks will be picked up to delete running containers

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.9.1
    • Fix Version/s: None
    • Component/s: applications
    • Labels:
      None

      Description

      I suspect there is a bug in Yarn deletion task service, below is my repo steps:

      1. First let's set yarn.nodemanager.delete.debug-delay-sec=3600, that means when the app finished, the Binary/container folder will be deleted after 3600 seconds.
      2. when the application App1 (long running service) is running on machine machine1, and machine1 shutdown, ContainerManagerImpl#serviceStop() will be called -> ContainerManagerImpl#cleanUpApplicationsOnNMShutDown, and ApplicationFinishEvent will be sent, and then some delection tasks will be created, but be stored in DB and will be picked up to execute 3600 seconds.
      3. 100 seconds later, machine1 comes back, and the same app is assigned to run this this machine, container created and works well.
      4. then deleting task created in step 2 will be picked up to delete containers created in step 3 later.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              sihai Sihai Ke
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: