Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3902

MR AM should reuse containers for map tasks, there-by allowing fine-grained control on num-maps for users without need for CombineFileInputFormat etc.

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: applicationmaster, mrv2
    • Labels:
      None

      Description

      The MR AM is now in a great position to reuse containers across (map) tasks. This is something similar to JVM re-use we had in 0.20.x, but in a significantly better manner:

      1. Consider data-locality when re-using containers
      2. Consider the new shuffle - ensure that reduces fetch output of the whole container at once (i.e. all maps) : MAPREDUCE-4525
      1. AM_ContainerRefactor.pdf
        151 kB
        Siddharth Seth
      2. AMContainerRefactorNotes.pdf
        58 kB
        Siddharth Seth
      3. MAPREDUCE-3902.2.patch
        55 kB
        Tsuyoshi Ozawa
      4. MAPREDUCE-3902.patch
        74 kB
        Arun C Murthy

        Issue Links

          Activity

          Kannan Rajah made changes -
          Assignee Siddharth Seth [ sseth ] Kannan Rajah [ rkannan82 ]
          Siddharth Seth made changes -
          Attachment AM_ContainerRefactor.pdf [ 12546255 ]
          Attachment AMContainerRefactorNotes.pdf [ 12546256 ]
          Tsuyoshi Ozawa made changes -
          Link This issue blocks MAPREDUCE-4502 [ MAPREDUCE-4502 ]
          Siddharth Seth made changes -
          Link This issue is blocked by YARN-75 [ YARN-75 ]
          Tsuyoshi Ozawa made changes -
          Link This issue relates to MAPREDUCE-4596 [ MAPREDUCE-4596 ]
          Tsuyoshi Ozawa made changes -
          Description The MR AM is now in a great position to reuse containers across (map) tasks. This is something similar to JVM re-use we had in 0.20.x, but in a significantly better manner:
          # Consider data-locality when re-using containers
          # Consider the new shuffle - ensure that reduces fetch output of the whole container at once (i.e. all maps)
          The MR AM is now in a great position to reuse containers across (map) tasks. This is something similar to JVM re-use we had in 0.20.x, but in a significantly better manner:
          # Consider data-locality when re-using containers
          # Consider the new shuffle - ensure that reduces fetch output of the whole container at once (i.e. all maps) : MAPREDUCE-4525
          Tsuyoshi Ozawa made changes -
          Link This issue is related to MAPREDUCE-4525 [ MAPREDUCE-4525 ]
          Tsuyoshi Ozawa made changes -
          Link This issue blocks MAPREDUCE-4502 [ MAPREDUCE-4502 ]
          Tsuyoshi Ozawa made changes -
          Attachment MAPREDUCE-3902.2.patch [ 12539060 ]
          Arun C Murthy made changes -
          Assignee Arun C Murthy [ acmurthy ] Siddharth Seth [ sseth ]
          Arun C Murthy made changes -
          Summary MR AM should reuse containers for map tasks MR AM should reuse containers for map tasks, there-by allowing fine-grained control on num-maps for users without need for CombineFileInputFormat etc.
          Arun C Murthy made changes -
          Field Original Value New Value
          Attachment MAPREDUCE-3902.patch [ 12515757 ]
          Arun C Murthy created issue -

            People

            • Assignee:
              Kannan Rajah
              Reporter:
              Arun C Murthy
            • Votes:
              1 Vote for this issue
              Watchers:
              36 Start watching this issue

              Dates

              • Created:
                Updated:

                Development