Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7229

Job History Server should ignore continuing move done_intermediate files when rename failed

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1.2
    • None
    • jobhistoryserver
    • None

    Description

      private void moveToDoneNow(final Path src, final Path target)
          throws IOException {
        LOG.info("Moving " + src.toString() + " to " + target.toString());
        try {
          intermediateDoneDirFc.rename(src, target, Options.Rename.NONE);
        } catch (FileNotFoundException e) {
          if (doneDirFc.util().exists(target)) {
            LOG.info("Source file " + src.toString() + " not found, but target "
                + "file " + target.toString() + " already exists. Move already "
                + "happened.");
          } else {
            throw e;
          }
        }
      }
      

      if files in done_intermediate dir were already deleted(deleted manually), not move to done dir. Job history will infinitely rename these files later, and throw exp always.This will cause newly created files in done_intermediate dir can not be moved. Later, when scan done_intermediate dir, FileContext.listStatus(done_intermediate) will so slow, and gc problem

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            Cedar Morty Zhong

            Dates

              Created:
              Updated:

              Slack

                Issue deployment