Hadoop Common
  1. Hadoop Common
  2. HADOOP-8220

ZKFailoverController doesn't handle failure to become active correctly

    Details

    • Hadoop Flags:
      Reviewed

      Description

      The ZKFC doesn't properly handle the case where the monitored service fails to become active. Currently, it catches the exception and logs a warning, but then continues on, after calling quitElection(). This causes a NPE when it later tries to use the same zkClient instance while handling that same request. There is a test case, but the test case doesn't ensure that the node that had the failure is later able to recover properly.

      1. hadoop-8220.txt
        9 kB
        Todd Lipcon
      2. hadoop-8220.txt
        12 kB
        Todd Lipcon
      3. hadoop-8220.txt
        12 kB
        Todd Lipcon
      4. hadoop-8220.txt
        15 kB
        Todd Lipcon
      5. hadoop-8220.txt
        15 kB
        Todd Lipcon

        Activity

        Todd Lipcon made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Hadoop Flags Reviewed [ 10343 ]
        Fix Version/s Auto Failover (HDFS-3042) [ 12320350 ]
        Resolution Fixed [ 1 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520652 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520563 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520520 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520334 ]
        Todd Lipcon made changes -
        Target Version/s 0.24.0, 0.23.3 [ 12317652, 12320059 ] Auto Failover (HDFS-3042) [ 12320350 ]
        Component/s auto-failover [ 12317908 ]
        Todd Lipcon made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Target Version/s 0.24.0, 0.23.3 [ 12317652, 12320059 ] 0.23.3, 0.24.0 [ 12320059, 12317652 ]
        Todd Lipcon made changes -
        Field Original Value New Value
        Attachment hadoop-8220.txt [ 12520062 ]
        Todd Lipcon created issue -

          People

          • Assignee:
            Todd Lipcon
            Reporter:
            Todd Lipcon
          • Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development