Hadoop HDFS
  1. Hadoop HDFS
  2. HDFS-3071

haadmin failover command does not provide enough detail for when target NN is not ready to be active

    Details

    • Type: Improvement Improvement
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.24.0
    • Fix Version/s: 2.0.0-alpha
    • Component/s: ha
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      When running the failover command, you can get an error message like the following:

      $ hdfs --config $(pwd) haadmin -failover namenode2 namenode1
      Failover failed: xxx.yyy/1.2.3.4:8020 is not ready to become active

      Unfortunately, the error message doesn't describe why that node isn't ready to be active. In my case, the target namenode's logs don't indicate anything either. It turned out that the issue was "Safe mode is ON.Resources are low on NN. Safe mode must be turned off manually.", but ideally the user would be told that at the time of the failover.

      1. hdfs-3071.txt
        38 kB
        Todd Lipcon
      2. hdfs-3071.txt
        35 kB
        Todd Lipcon
      3. hdfs-3071.txt
        34 kB
        Todd Lipcon
      4. hdfs-3071.txt
        35 kB
        Todd Lipcon
      5. hdfs-3071.txt
        34 kB
        Todd Lipcon
      6. hdfs-3071.txt
        11 kB
        Todd Lipcon

        Issue Links

          Activity

            People

            • Assignee:
              Todd Lipcon
              Reporter:
              Philip Zeyliger
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development