Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-11709

StandbyCheckpointer should handle an non-existing legacyOivImageDir gracefully

    XMLWordPrintableJSON

Details

    Description

      In StandbyCheckpointer, if the legacy OIV directory is not properly created, or was deleted for some reason (e.g. mis-operation), all checkpoint ops will fall. Not only the ANN won't receive new fsimages, the JNs will get full with edit log files, and cause NN to crash.

            // Save the legacy OIV image, if the output dir is defined.
            String outputDir = checkpointConf.getLegacyOivImageDir();
            if (outputDir != null && !outputDir.isEmpty()) {
              img.saveLegacyOIVImage(namesystem, outputDir, canceler);
            }
      

      It doesn't make sense to let such an unimportant part (saving OIV) abort all checkpoints and cause NN crash (and possibly lose data).

      Attachments

        1. HDFS-11709.000.patch
          1 kB
          Erik Krogen

        Issue Links

          Activity

            People

              xkrogen Erik Krogen
              zhz Zhe Zhang
              Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: