Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
0.23.3, 0.24.0
-
None
-
Reviewed
Description
The ZKFC doesn't properly handle the case where the monitored service fails to become active. Currently, it catches the exception and logs a warning, but then continues on, after calling quitElection(). This causes a NPE when it later tries to use the same zkClient instance while handling that same request. There is a test case, but the test case doesn't ensure that the node that had the failure is later able to recover properly.