Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.0.3
    • Fix Version/s: 2.0.3-alpha
    • Component/s: documentation
    • Labels:
      None
    • Target Version/s:
    • Tags:
      multipleoutputs, multipletestoutputformat, new api, lazyoutputformat

      Description

      In the new API, and using MultipleOutputs it is possible to segment output into directories by using MultipleOutputs.write(KEYOUT key, VALUEOUT value, String baseOutputPath) in the Reducer to determine the output directory, and by using LazyOutputFormat at the job-level config to suppress normal output [eg use LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class); instead of job.setOutputFormatClass(TextOutputFormat.class);]

      This recreates the functionality previously provided in the old API by using MultipleTextOutputFormat (etc)

        Attachments

        1. MAPREDUCE-4616.patch
          5 kB
          Tony Burton
        2. MAPREDUCE-4616.patch
          4 kB
          Tony Burton

          Activity

            People

            • Assignee:
              tonyb Tony Burton
              Reporter:
              tonyb Tony Burton
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: