Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3071

haadmin failover command does not provide enough detail for when target NN is not ready to be active

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.0-alpha
    • Fix Version/s: 2.0.0-alpha
    • Component/s: ha
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      When running the failover command, you can get an error message like the following:

      $ hdfs --config $(pwd) haadmin -failover namenode2 namenode1
      Failover failed: xxx.yyy/1.2.3.4:8020 is not ready to become active

      Unfortunately, the error message doesn't describe why that node isn't ready to be active. In my case, the target namenode's logs don't indicate anything either. It turned out that the issue was "Safe mode is ON.Resources are low on NN. Safe mode must be turned off manually.", but ideally the user would be told that at the time of the failover.

        Attachments

        1. hdfs-3071.txt
          38 kB
          Todd Lipcon
        2. hdfs-3071.txt
          35 kB
          Todd Lipcon
        3. hdfs-3071.txt
          34 kB
          Todd Lipcon
        4. hdfs-3071.txt
          35 kB
          Todd Lipcon
        5. hdfs-3071.txt
          34 kB
          Todd Lipcon
        6. hdfs-3071.txt
          11 kB
          Todd Lipcon

          Issue Links

            Activity

              People

              • Assignee:
                tlipcon Todd Lipcon
                Reporter:
                philip Philip Zeyliger
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: