Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8472 YARN Container Phase 2
  3. YARN-7644

NM gets backed up deleting docker containers

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.2.0
    • nodemanager
    • Reviewed

    Description

      We are sending a docker stop to the docker container with a timeout of 10 seconds when we shut down a container. If the container does not stop after 10 seconds then we force kill it. However, the docker stop command is a blocking call. So in cases where lots of containers don't go down with the initial SIGTERM, we have to wait 10+ seconds for the docker stop to return. This ties up the ContainerLaunch handler and so these kill events back up. It also appears to be backing up new container launches as well.

      Attachments

        1. YARN-7644.001.patch
          31 kB
          Chandni Singh
        2. YARN-7644.002.patch
          32 kB
          Chandni Singh
        3. YARN-7644.003.patch
          33 kB
          Chandni Singh
        4. YARN-7644.004.patch
          33 kB
          Chandni Singh
        5. YARN-7644.005.patch
          33 kB
          Chandni Singh
        6. YARN-7644.006.patch
          33 kB
          Chandni Singh

        Issue Links

          Activity

            People

              csingh Chandni Singh
              ebadger Eric Badger
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: