Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2410

document multiple keys per reducer oddity in hadoop streaming FAQ

    Details

    • Hadoop Flags:
      Reviewed
    • Release Note:
      Add an FAQ entry regarding the differences between Java API and Streaming development of MR programs.
    • Tags:
      streaming

      Description

      Hi,
      for a newcomer to hadoop streaming, it comes as a surprise that the reducer receives arbitrary keys, unlike the "real" hadoop where a reducer works on a single key.
      An explanation for this is @ http://mail-archives.apache.org/mod_mbox/hadoop-common-user/201103.mbox/browser

      I suggest to add this to the FAQ of hadoop streaming

        Attachments

        1. MAPREDUCE-2410.r1.diff
          1.0 kB
          Harsh J
        2. MAPREDUCE-2410.r2.diff
          1 kB
          Harsh J
        3. MAPREDUCE-2410.r3.diff
          1 kB
          Harsh J

          Activity

            People

            • Assignee:
              qwertymaniac Harsh J
              Reporter:
              dieter_be Dieter Plaetinck
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 40m
                40m
                Remaining:
                Remaining Estimate - 40m
                40m
                Logged:
                Time Spent - Not Specified
                Not Specified