Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4924

NM recovery race can lead to container not cleaned up

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.7.2, 3.0.0-alpha1
    • 2.8.0, 2.7.3, 3.0.0-alpha1
    • nodemanager
    • None
    • Reviewed

    Description

      It's probably a small window but we observed a case where the NM crashed and then a container was not properly cleaned up during recovery.

      I will add details in first comment.

      Attachments

        1. YARN-4924.05.patch
          16 kB
          sandflee
        2. YARN-4924.04.patch
          15 kB
          sandflee
        3. YARN-4924.03.patch
          15 kB
          sandflee
        4. YARN-4924.02.patch
          16 kB
          sandflee
        5. YARN-4924.01.patch
          16 kB
          sandflee

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sandflee sandflee
            nroberts Nathan Roberts
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment