Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-4616

Improvement to MultipleOutputs javadocs

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.0.3
    • Fix Version/s: 2.0.3-alpha
    • Component/s: documentation
    • Labels:
      None
    • Target Version/s:
    • Tags:
      multipleoutputs, multipletestoutputformat, new api, lazyoutputformat

      Description

      In the new API, and using MultipleOutputs it is possible to segment output into directories by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer to determine the output directory, and by using LazyOutputFormat at the job-level config to suppress normal output [eg use LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class); instead of job.setOutputFormatClass(TextOutputFormat.class);]

      This recreates the functionality previously provided in the old API by using MultipleTextOutputFormat (etc)

        Attachments

        1. MAPREDUCE-4616.patch
          5 kB
          Tony Burton
        2. MAPREDUCE-4616.patch
          4 kB
          Tony Burton

          Activity

            People

            • Assignee:
              tonyb Tony Burton
              Reporter:
              tonyb Tony Burton

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment