Hadoop Common
  1. Hadoop Common
  2. HADOOP-8220

ZKFailoverController doesn't handle failure to become active correctly

    Details

      Description

      The ZKFC doesn't properly handle the case where the monitored service fails to become active. Currently, it catches the exception and logs a warning, but then continues on, after calling quitElection(). This causes a NPE when it later tries to use the same zkClient instance while handling that same request. There is a test case, but the test case doesn't ensure that the node that had the failure is later able to recover properly.

      1. hadoop-8220.txt
        9 kB
        Todd Lipcon
      2. hadoop-8220.txt
        12 kB
        Todd Lipcon
      3. hadoop-8220.txt
        12 kB
        Todd Lipcon
      4. hadoop-8220.txt
        15 kB
        Todd Lipcon
      5. hadoop-8220.txt
        15 kB
        Todd Lipcon

        Activity

        Todd Lipcon created issue -
        Todd Lipcon made changes -
        Field Original Value New Value
        Attachment hadoop-8220.txt [ 12520062 ]
        Todd Lipcon made changes -
        Status Open [ 1 ] Patch Available [ 10002 ]
        Target Version/s 0.24.0, 0.23.3 [ 12317652, 12320059 ] 0.23.3, 0.24.0 [ 12320059, 12317652 ]
        Todd Lipcon made changes -
        Target Version/s 0.24.0, 0.23.3 [ 12317652, 12320059 ] Auto Failover (HDFS-3042) [ 12320350 ]
        Component/s auto-failover [ 12317908 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520334 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520520 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520563 ]
        Todd Lipcon made changes -
        Attachment hadoop-8220.txt [ 12520652 ]
        Todd Lipcon made changes -
        Status Patch Available [ 10002 ] Resolved [ 5 ]
        Hadoop Flags Reviewed [ 10343 ]
        Fix Version/s Auto Failover (HDFS-3042) [ 12320350 ]
        Resolution Fixed [ 1 ]

          People

          • Assignee:
            Todd Lipcon
            Reporter:
            Todd Lipcon
          • Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development