Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5457

Add a KeyOnlyTextOutputReader to enable streaming to write out text files without separators

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.3.0, 2.3.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      MR jobs sometimes want to just output lines of text, not key/value pairs. TextOutputFormat handles this by, if a null value is given, outputting only the key with no separator. Streaming jobs are unable to take advantage of this, because they can't output null values. A text output format reader takes each line as a key and outputs NullWritables for values would allow streaming jobs to output lines of text.

        Attachments

        1. MAPREDUCE-5457.patch
          6 kB
          Sandy Ryza
        2. MAPREDUCE-5457-1.patch
          13 kB
          Sandy Ryza
        3. MAPREDUCE-5457-2.patch
          12 kB
          Sandy Ryza
        4. MAPREDUCE-5457-3.patch
          12 kB
          Sandy Ryza
        5. MAPREDUCE-5457-branch-1.patch
          6 kB
          Sandy Ryza
        6. MAPREDUCE-5457-branch-1-1.patch
          12 kB
          Sandy Ryza
        7. MAPREDUCE-5457-branch-1-2.patch
          12 kB
          Sandy Ryza

          Activity

            People

            • Assignee:
              sandyr Sandy Ryza
              Reporter:
              sandyr Sandy Ryza
            • Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: