Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3597

SNN can fail to start on upgrade

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.0.0-alpha
    • 0.23.3, 2.0.2-alpha
    • namenode
    • Reviewed

    Description

      When upgrading from 1.x to 2.0.0, the SecondaryNameNode can fail to start up:

      2012-06-16 09:52:33,812 ERROR org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode: Exception in doCheckpoint
      java.io.IOException: Inconsistent checkpoint fields.
      LV = -40 namespaceID = 64415959 cTime = 1339813974990 ; clusterId = CID-07a82b97-8d04-4fdd-b3a1-f40650163245 ; blockpoolId = BP-1792677198-172.29.121.67-1339813967723.
      Expecting respectively: -19; 64415959; 0; ; .
      at org.apache.hadoop.hdfs.server.namenode.CheckpointSignature.validateStorageInfo(CheckpointSignature.java:120)
      at org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doCheckpoint(SecondaryNameNode.java:454)
      at org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.doWork(SecondaryNameNode.java:334)
      at org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode$2.run(SecondaryNameNode.java:301)
      at org.apache.hadoop.security.SecurityUtil.doAsLoginUserOrFatal(SecurityUtil.java:438)
      at org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode.run(SecondaryNameNode.java:297)
      at java.lang.Thread.run(Thread.java:662)
      

      The error check we're hitting came from HDFS-1073, and it's intended to verify that we're connecting to the correct NN. But the check is too strict and considers "different metadata version" to be the same as "different clusterID".

      I believe the check in doCheckpoint simply needs to explicitly check for and handle the update case.

      Attachments

        1. hdfs-3597.txt
          8 kB
          Andy Isaacson
        2. hdfs-3597-2.txt
          7 kB
          Andy Isaacson
        3. hdfs-3597-3.txt
          8 kB
          Andy Isaacson
        4. hdfs-3597-4.txt
          8 kB
          Andy Isaacson

        Activity

          People

            adi2 Andy Isaacson
            adi2 Andy Isaacson
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: