Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7187

RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find a task when the priority of container is PRIORITY_MAP

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 2.7.5, 3.1.1, 2.9.2
    • None
    • applicationmaster
    • None

    Description

      The resource manager may has allocated a map container on a host ("h1" for example) for a application, and the container has not been fetched by the MRAppMaster. At this time, the  MRAppMaster receives a task fail event, and the task is on host h1. The event cause the h1 blacklisted. Now the MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster can not assign the container since it is on a blacklisted host. The #getContainerReqToReplace fails returning  another task, may cause a map task hang forever.

      Attachments

        1. MAPREDUCE-7187.001.patch
          7 kB
          Zhizhen Hou

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            houzhizhen Zhizhen Hou
            houzhizhen Zhizhen Hou

            Dates

              Created:
              Updated:

              Slack

                Issue deployment