Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1362

Pipes should be ported to the new mapreduce API

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • pipes
    • None
    • Update Hadoop Pipes to support MRv2 API

    Description

      Pipes is still currently using the old mapred API. This prevents us from using pipes with HBase's TableInputFormat, HRegionPartitioner, etc.

      Here is a rough proposal for how to accomplish this:

      • Add a new package org.apache.hadoop.mapreduce.pipes that uses the new mapred API.
      • the new pipes package will run side by side with the old one. old one should get deprecated at some point.
      • the wire protocol used between PipesMapper and PipesReducer and C++ programs must not change.
      • bin/hadoop should support both pipes (old api) and pipes2 (new api)

      Does this sound reasonable?

      Attachments

        1. MAPREDUCE-1362.patch
          123 kB
          Joe Mudd
        2. MAPREDUCE-1362.patch
          68 kB
          Bassam Tabbara
        3. MAPREDUCE-1362-trunk.patch
          73 kB
          Bassam Tabbara

        Activity

          People

            Unassigned Unassigned
            bassam Bassam Tabbara
            Votes:
            1 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated: