Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4591

HA clients can fail to fail over while Standby NN is performing long checkpoint

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.4-alpha
    • Fix Version/s: 2.1.0-beta
    • Component/s: ha, namenode
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Clients know to fail over to talk to the Active NN when they perform an RPC to the Standby NN and it throws a StandbyException. However, most places in the code that check if the NN is in the standby state do so inside the FSNS fsLock. Since this lock is held for the duration of the saveNamespace during a checkpoint, StandbyExceptions will not be thrown during this time.

        Attachments

        1. HDFS-4591.patch
          31 kB
          Aaron Myers
        2. HDFS-4591.patch
          28 kB
          Aaron Myers
        3. HDFS-4591.patch
          27 kB
          Aaron Myers

          Activity

            People

            • Assignee:
              atm Aaron Myers
              Reporter:
              atm Aaron Myers
            • Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: