Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.1.4, 1.2.0, 1.3.0
-
None
Description
The ZooKeeperCompletedCheckpointStore reads a set of RetrievableStateHandles from ZooKeeper upon recovery. It then tries to retrieve the CompletedCheckpoint from the latest state handle. If the retrieve operation fails, then the whole recovery of completed checkpoints fails even though the store might have read older state handles from ZooKeeper.
I propose to harden the behaviour by removing broken state handles and returning the first successfully retrieved CompletedCheckpoint.