Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1558

changes to OutputFormat to work on temporary directory to enable re-running crashed jobs (Issue: 1121)

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None
    • all

    Description

      Add OutputFormat methods like:

      /** Called to initialize output for this job. */
      void initialize(JobConf job) throws IOException;

      /** Called to finalize output for this job. */
      void commit(JobConf job) throws IOException;

      In the base implemenation for FileSystem output, initialize() might then create a temporary directory for the job, removing any that already exists, and commit could rename the temporary output directory to the final name.

      The existing checkOutputSpecs() would continue to throw an exception if the final output already exists.

      Attachments

        1. hadoop-1558-JUL2607-1600.txt
          22 kB
          Alejandro Abdelnur

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            tucu00 Alejandro Abdelnur
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment