Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-5279

Potential Container leak in NM in preemption flow

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Implemented
    • None
    • None
    • None

    Description

      In discussion YARN-4862 comment, it is observed that there could be a container leak in NodeManager whenever container is preempted from RM

      Basically if NM receives same containerId details in containersToCleanUp and containersToBeRemovedFromNM in the same heartbeat then container will never-ever removed in NMContext. Rather NM kills the container of containersToCleanup and send back status again to RM. But RM blindly reject the status since RMContainer is already removed and it is null.

      I think whenever RMContainer is null, RMNode should be informed to send containersToBeRemovedFromNM so that NM will remove from its context.

      Attachments

        1. 0001-YARN-5279.patch
          11 kB
          Rohith Sharma K S

        Activity

          People

            rohithsharma Rohith Sharma K S
            rohithsharma Rohith Sharma K S
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: