Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-11709

StandbyCheckpointer should handle an non-existing legacyOivImageDir gracefully

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

      Description

      In StandbyCheckpointer, if the legacy OIV directory is not properly created, or was deleted for some reason (e.g. mis-operation), all checkpoint ops will fall. Not only the ANN won't receive new fsimages, the JNs will get full with edit log files, and cause NN to crash.

            // Save the legacy OIV image, if the output dir is defined.
            String outputDir = checkpointConf.getLegacyOivImageDir();
            if (outputDir != null && !outputDir.isEmpty()) {
              img.saveLegacyOIVImage(namesystem, outputDir, canceler);
            }
      

      It doesn't make sense to let such an unimportant part (saving OIV) abort all checkpoints and cause NN crash (and possibly lose data).

        Attachments

        1. HDFS-11709.000.patch
          1 kB
          Erik Krogen

        Issue Links

          Activity

            People

            • Assignee:
              xkrogen Erik Krogen
              Reporter:
              zhz Zhe Zhang

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment