Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-605

In Streaming, allow different mappers for different subsets of the input

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • contrib/streaming
    • None

    Description

      The command line may look like this:

      -mapper mapper-command-1 -input dir11 -mapper mapper-command-2 -input dir22 input -dir21

      meaning that map phase will apply mapper-command-1 to part files from dir11, and the part files from dir22 and dir21 will be processed by mapper-command-2
      then all will be shuffled and processed by a single reducer.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              arkady arkady borkovsky
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated: