Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-5535 Umbrella jira for improved HDFS rolling upgrades
  3. HDFS-6029

Secondary NN fails to checkpoint after -rollingUpgrade prepare

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • HDFS-5535 (Rolling upgrades)
    • None
    • datanode, ha, hdfs-client, namenode
    • None

    Description

      If Secondary NN crashes right after calling dfsadmin -rollingUpgrade prepare, it fails to checkpoint because it fails to download the newest fsimage whose name is fsimage_rollback_txid.

      To reproduce this bug:

      1. Run NN and SNN.
      2. Run dfsadmin -rollingUpgrade prepare.
      3. Shutdown SNN, clear all fsimage created by SNN.
      4. Restart SNN, SNN tries to download the fsimage_txid instead of fsimage_rollback_txid, resulting the failure.

      Attachments

        1. HDFS-6029.000.patch
          11 kB
          Jing Zhao

        Activity

          People

            jingzhao Jing Zhao
            wheat9 Haohui Mai
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: