Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1073 Simpler model for Namenode's fs Image and edit Logs
  3. HDFS-2026

1073: 2NN needs to handle case of reformatted NN better

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • Edit log branch (HDFS-1073)
    • namenode
    • None
    • Reviewed

    Description

      Currently in the 1073 branch, the following steps ends up with a very confused 2NN:

      • format NN, run NN
      • start 2NN, perform some checkpoints
      • reformat NN, start NN on new namespace
      • restart same 2NN

      The 2NN currently saves the new VERSION info into its local storage directory but doesn't clear out the old checkpoint or edits files. This is obviously wrong and might lead to a corrupt checkpoint getting uploaded.

      If the 2NN has storage directories with VERSION info, and connects to an NN with different VERSION info, there are two alternatives:
      a) refuse to perform any checkpoints until the operator issues a "secondarynamenode -format" command (this is similar to how the backupnode/checkpointnode works)
      b) clear the current contents of the storage directory and save the new NN's VERSION info.

      Attachments

        1. hdfs-2026.txt
          25 kB
          Todd Lipcon
        2. hdfs-2026.txt
          26 kB
          Todd Lipcon

        Issue Links

          Activity

            People

              tlipcon Todd Lipcon
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: