Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3071

haadmin failover command does not provide enough detail for when target NN is not ready to be active

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0-alpha
    • 2.0.0-alpha
    • ha
    • None
    • Reviewed

    Description

      When running the failover command, you can get an error message like the following:

      $ hdfs --config $(pwd) haadmin -failover namenode2 namenode1
      Failover failed: xxx.yyy/1.2.3.4:8020 is not ready to become active

      Unfortunately, the error message doesn't describe why that node isn't ready to be active. In my case, the target namenode's logs don't indicate anything either. It turned out that the issue was "Safe mode is ON.Resources are low on NN. Safe mode must be turned off manually.", but ideally the user would be told that at the time of the failover.

      Attachments

        1. hdfs-3071.txt
          11 kB
          Todd Lipcon
        2. hdfs-3071.txt
          34 kB
          Todd Lipcon
        3. hdfs-3071.txt
          35 kB
          Todd Lipcon
        4. hdfs-3071.txt
          34 kB
          Todd Lipcon
        5. hdfs-3071.txt
          35 kB
          Todd Lipcon
        6. hdfs-3071.txt
          38 kB
          Todd Lipcon

        Issue Links

          Activity

            People

              tlipcon Todd Lipcon
              philip Philip Martin
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: