Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7239

Possible launch/cleanup race condition in ContainersLauncher

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • nodemanager

    Description

      ContainersLauncher.handle() submits the launch job and then adds the job into the collection risking that the cleanup will miss it and return. This should be in reversed order in all 3 instances:

              containerLauncher.submit(launch);
              running.put(containerId, launch);
      

      The cleanup code that the above code is racing with:

              ContainerLaunch runningContainer = running.get(containerId);
              if (runningContainer == null) {
                // Container not launched. So nothing needs to be done.
                LOG.info("Container " + containerId + " not running, nothing to signal.");
                return;
              }
      ...
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            miklos.szegedi@cloudera.com Miklos Szegedi
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: