Details

    • Reviewed
    • Allow ReduceTask loading a third party plugin for shuffle (and merge) instead of the default shuffle.

    Description

      Support generic shuffle service as set of two plugins: ShuffleProvider & ShuffleConsumer.
      This will satisfy the following needs:

      1. Better shuffle and merge performance. For example: we are working on shuffle plugin that performs shuffle over RDMA in fast networks (10gE, 40gE, or Infiniband) instead of using the current HTTP shuffle. Based on the fast RDMA shuffle, the plugin can also utilize a suitable merge approach during the intermediate merges. Hence, getting much better performance.
      2. Satisfy MAPREDUCE-3060 - generic shuffle service for avoiding hidden dependency of NodeManager with a specific version of mapreduce shuffle (currently targeted to 0.24.0).

      References:

      1. Hadoop Acceleration through Network Levitated Merging, by Prof. Weikuan Yu from Auburn University with others, http://pasl.eng.auburn.edu/pubs/sc11-netlev.pdf
      2. I am attaching 2 documents with suggested Top Level Design for both plugins (currently, based on 1.0 branch)
      3. I am providing link for downloading UDA - Mellanox's open source plugin that implements generic shuffle service using RDMA and levitated merge. Note: At this phase, the code is in C++ through JNI and you should consider it as beta only. Still, it can serve anyone that wants to implement or contribute to levitated merge. (Please be advised that levitated merge is mostly suit in very fast networks) - http://www.mellanox.com/content/pages.php?pg=products_dyn&product_family=144&menu_section=69

      Attachments

        1. HADOOP-1.x.y.patch
          20 kB
          Avner BenHanoch
        2. Hadoop Shuffle Plugin Design.rtf
          78 kB
          Avner BenHanoch
        3. mapreduce-4049.patch
          25 kB
          Avner BenHanoch
        4. MAPREDUCE-4049--branch-1.patch
          56 kB
          Avner BenHanoch
        5. MAPREDUCE-4049--branch-1.patch
          54 kB
          Avner BenHanoch
        6. MAPREDUCE-4049--branch-1.patch
          27 kB
          Avner BenHanoch

        Issue Links

          Activity

            People

              avnerb Avner BenHanoch
              avnerb Avner BenHanoch
              Votes:
              9 Vote for this issue
              Watchers:
              50 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: