Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7187

RMContainerAllocator.ScheduledRequests#getContainerReqToReplace may not find a task when the priority of container is PRIORITY_MAP

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Patch Available
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.7.5, 3.1.1, 2.9.2
    • Fix Version/s: None
    • Component/s: applicationmaster
    • Labels:
      None

      Description

      The resource manager may has allocated a map container on a host ("h1" for example) for a application, and the container has not been fetched by the MRAppMaster. At this time, the  MRAppMaster receives a task fail event, and the task is on host h1. The event cause the h1 blacklisted. Now the MRAppMaster send a heartbeat, and receive a container on h1. The MRAppMaster can not assign the container since it is on a blacklisted host. The #getContainerReqToReplace fails returning  another task, may cause a map task hang forever.

        Attachments

        1. MAPREDUCE-7187.001.patch
          7 kB
          Zhizhen Hou

          Activity

            People

            • Assignee:
              houzhizhen Zhizhen Hou
              Reporter:
              houzhizhen Zhizhen Hou
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: