Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-4616

Improvement to MultipleOutputs javadocs

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.0.3
    • 2.0.3-alpha
    • documentation
    • None
    • multipleoutputs, multipletestoutputformat, new api, lazyoutputformat

    Description

      In the new API, and using MultipleOutputs it is possible to segment output into directories by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer to determine the output directory, and by using LazyOutputFormat at the job-level config to suppress normal output [eg use LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class); instead of job.setOutputFormatClass(TextOutputFormat.class);]

      This recreates the functionality previously provided in the old API by using MultipleTextOutputFormat (etc)

      Attachments

        1. MAPREDUCE-4616.patch
          5 kB
          Tony Burton
        2. MAPREDUCE-4616.patch
          4 kB
          Tony Burton

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            tonyb Tony Burton
            tonyb Tony Burton
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment