Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5197

RM leaks containers if running container disappears from node update

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 2.7.2, 2.6.4
    • 2.8.0, 2.6.5, 2.7.4
    • resourcemanager
    • None
    • Reviewed

    Description

      Once a node reports a container running in a status update, the corresponding RMNodeImpl will track the container in its launchedContainers map. If the node somehow misses sending the completed container status to the RM and the container simply disappears from subsequent heartbeats, the container will leak in launchedContainers forever and the container completion event will not be sent to the scheduler.

      Attachments

        1. YARN-5197.001.patch
          14 kB
          Jason Darrell Lowe
        2. YARN-5197.002.patch
          14 kB
          Jason Darrell Lowe
        3. YARN-5197.003.patch
          14 kB
          Jason Darrell Lowe
        4. YARN-5197-branch-2.7.003.patch
          13 kB
          Jason Darrell Lowe
        5. YARN-5197-branch-2.8.003.patch
          14 kB
          Jason Darrell Lowe

        Issue Links

          Activity

            People

              jlowe Jason Darrell Lowe
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: