Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3902

MR AM should reuse containers for map tasks, there-by allowing fine-grained control on num-maps for users without need for CombineFileInputFormat etc.

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: applicationmaster, mrv2
    • Labels:
      None

      Description

      The MR AM is now in a great position to reuse containers across (map) tasks. This is something similar to JVM re-use we had in 0.20.x, but in a significantly better manner:

      1. Consider data-locality when re-using containers
      2. Consider the new shuffle - ensure that reduces fetch output of the whole container at once (i.e. all maps) : MAPREDUCE-4525

        Attachments

        1. AM_ContainerRefactor.pdf
          151 kB
          Siddharth Seth
        2. AMContainerRefactorNotes.pdf
          58 kB
          Siddharth Seth
        3. MAPREDUCE-3902.2.patch
          55 kB
          Tsuyoshi Ozawa
        4. MAPREDUCE-3902.patch
          74 kB
          Arun C Murthy

          Issue Links

            Activity

              People

              • Assignee:
                rkannan82 Kannan Rajah
                Reporter:
                acmurthy Arun C Murthy
              • Votes:
                1 Vote for this issue
                Watchers:
                37 Start watching this issue

                Dates

                • Created:
                  Updated: