Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-14528

Failover from Active to Standby Failed

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • ha

    Description

       In a cluster with more than one Standby namenode, manual failover throws exception for some cases

      When trying to exectue the failover command from active to standby 

      ./hdfs haadmin  -failover nn1 nn2, below Exception is thrown

        Operation failed: Call From X-X-X-X/X-X-X-X to Y-Y-Y-Y:nnnn failed on connection exception: java.net.ConnectException: Connection refused

      This is encountered in the following cases :

       Scenario 1 : 

      Namenodes - NN1(Active) , NN2(Standby), NN3(Standby)

      When trying to manually failover from NN1 to NN2 if NN3 is down, Exception is thrown

      Scenario 2 :

       Namenodes - NN1(Active) , NN2(Standby), NN3(Standby)

      ZKFC's -              ZKFC1,            ZKFC2,            ZKFC3

      When trying to manually failover using NN1 to NN3 if NN3's ZKFC (ZKFC3) is down, Exception is thrown

      Attachments

        1. HDFS-14528.007.patch
          10 kB
          Ravuri Sushma sree
        2. HDFS-14528.006.patch
          10 kB
          Ravuri Sushma sree
        3. HDFS-14528.005.patch
          8 kB
          Ravuri Sushma sree
        4. HDFS-14528.004.patch
          8 kB
          Ravuri Sushma sree
        5. HDFS-14528.003.patch
          8 kB
          Ravuri Sushma sree
        6. HDFS-14528.2.Patch
          7 kB
          Ravuri Sushma sree
        7. ZKFC_issue.patch
          0.9 kB
          Ravuri Sushma sree

        Activity

          People

            Sushma_28 Ravuri Sushma sree
            Sushma_28 Ravuri Sushma sree
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated: