Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5197

RM leaks containers if running container disappears from node update

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 2.7.2, 2.6.4
    • Fix Version/s: 2.8.0, 2.6.5, 2.7.4
    • Component/s: resourcemanager
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Once a node reports a container running in a status update, the corresponding RMNodeImpl will track the container in its launchedContainers map. If the node somehow misses sending the completed container status to the RM and the container simply disappears from subsequent heartbeats, the container will leak in launchedContainers forever and the container completion event will not be sent to the scheduler.

        Attachments

        1. YARN-5197-branch-2.7.003.patch
          13 kB
          Jason Lowe
        2. YARN-5197-branch-2.8.003.patch
          14 kB
          Jason Lowe
        3. YARN-5197.003.patch
          14 kB
          Jason Lowe
        4. YARN-5197.002.patch
          14 kB
          Jason Lowe
        5. YARN-5197.001.patch
          14 kB
          Jason Lowe

          Issue Links

            Activity

              People

              • Assignee:
                jlowe Jason Lowe
                Reporter:
                jlowe Jason Lowe
              • Votes:
                0 Vote for this issue
                Watchers:
                11 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: