Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4591

HA clients can fail to fail over while Standby NN is performing long checkpoint

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.4-alpha
    • 2.1.0-beta
    • ha, namenode
    • None
    • Reviewed

    Description

      Clients know to fail over to talk to the Active NN when they perform an RPC to the Standby NN and it throws a StandbyException. However, most places in the code that check if the NN is in the standby state do so inside the FSNS fsLock. Since this lock is held for the duration of the saveNamespace during a checkpoint, StandbyExceptions will not be thrown during this time.

      Attachments

        1. HDFS-4591.patch
          31 kB
          Aaron Myers
        2. HDFS-4591.patch
          28 kB
          Aaron Myers
        3. HDFS-4591.patch
          27 kB
          Aaron Myers

        Activity

          People

            atm Aaron Myers
            atm Aaron Myers
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: