Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-8220

ZKFailoverController doesn't handle failure to become active correctly

    Details

      Description

      The ZKFC doesn't properly handle the case where the monitored service fails to become active. Currently, it catches the exception and logs a warning, but then continues on, after calling quitElection(). This causes a NPE when it later tries to use the same zkClient instance while handling that same request. There is a test case, but the test case doesn't ensure that the node that had the failure is later able to recover properly.

        Attachments

        1. hadoop-8220.txt
          9 kB
          Todd Lipcon
        2. hadoop-8220.txt
          12 kB
          Todd Lipcon
        3. hadoop-8220.txt
          12 kB
          Todd Lipcon
        4. hadoop-8220.txt
          15 kB
          Todd Lipcon
        5. hadoop-8220.txt
          15 kB
          Todd Lipcon

          Activity

            People

            • Assignee:
              tlipcon Todd Lipcon
              Reporter:
              tlipcon Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: