Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1558

changes to OutputFormat to work on temporary directory to enable re-running crashed jobs (Issue: 1121)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None
    • all

    Description

      Add OutputFormat methods like:

      /** Called to initialize output for this job. */
      void initialize(JobConf job) throws IOException;

      /** Called to finalize output for this job. */
      void commit(JobConf job) throws IOException;

      In the base implemenation for FileSystem output, initialize() might then create a temporary directory for the job, removing any that already exists, and commit could rename the temporary output directory to the final name.

      The existing checkOutputSpecs() would continue to throw an exception if the final output already exists.

      Attachments

        1. hadoop-1558-JUL2607-1600.txt
          22 kB
          Alejandro Abdelnur

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tucu00 Alejandro Abdelnur
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: