Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-5940

ZooKeeperCompletedCheckpointStore cannot handle broken state handles

    XMLWordPrintableJSON

Details

    Description

      The ZooKeeperCompletedCheckpointStore reads a set of RetrievableStateHandles from ZooKeeper upon recovery. It then tries to retrieve the CompletedCheckpoint from the latest state handle. If the retrieve operation fails, then the whole recovery of completed checkpoints fails even though the store might have read older state handles from ZooKeeper.

      I propose to harden the behaviour by removing broken state handles and returning the first successfully retrieved CompletedCheckpoint.

      Attachments

        Activity

          People

            trohrmann Till Rohrmann
            trohrmann Till Rohrmann
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: