Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-5940

ZooKeeperCompletedCheckpointStore cannot handle broken state handles

    Details

      Description

      The ZooKeeperCompletedCheckpointStore reads a set of RetrievableStateHandles from ZooKeeper upon recovery. It then tries to retrieve the CompletedCheckpoint from the latest state handle. If the retrieve operation fails, then the whole recovery of completed checkpoints fails even though the store might have read older state handles from ZooKeeper.

      I propose to harden the behaviour by removing broken state handles and returning the first successfully retrieved CompletedCheckpoint.

        Attachments

          Activity

            People

            • Assignee:
              till.rohrmann Till Rohrmann
              Reporter:
              till.rohrmann Till Rohrmann
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: