Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-3987

am container complete msg ack to NM once RM receive it

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.8.0, 3.0.0-alpha1
    • Component/s: resourcemanager
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      In our cluster we set max-am-attempts to a very very large num, and unfortunately our am crash after launched, leaving too many completed container(AM container) in NM. completed container is removed from NM and NMStateStore only if container complete is passed to AM, but if AM couldn't be launched, the completed AM container couldn't be cleaned, and may eat up NM heap memory.

        Attachments

        1. YARN-3987.001.patch
          2 kB
          sandflee
        2. YARN-3987.002.patch
          2 kB
          sandflee

          Activity

            People

            • Assignee:
              sandflee sandflee
              Reporter:
              sandflee sandflee
            • Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: