Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2410

document multiple keys per reducer oddity in hadoop streaming FAQ

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed
    • Add an FAQ entry regarding the differences between Java API and Streaming development of MR programs.
    • streaming

    Description

      Hi,
      for a newcomer to hadoop streaming, it comes as a surprise that the reducer receives arbitrary keys, unlike the "real" hadoop where a reducer works on a single key.
      An explanation for this is @ http://mail-archives.apache.org/mod_mbox/hadoop-common-user/201103.mbox/browser

      I suggest to add this to the FAQ of hadoop streaming

      Attachments

        1. MAPREDUCE-2410.r1.diff
          1.0 kB
          Harsh J
        2. MAPREDUCE-2410.r2.diff
          1 kB
          Harsh J
        3. MAPREDUCE-2410.r3.diff
          1 kB
          Harsh J

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            qwertymaniac Harsh J
            dieter_be Dieter Plaetinck
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 40m
                40m
                Remaining:
                Remaining Estimate - 40m
                40m
                Logged:
                Time Spent - Not Specified
                Not Specified

                Issue deployment