Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-19359

Restore from Checkpoint fails if checkpoint folders is corrupt/partial

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Reopened
    • Not a Priority
    • Resolution: Unresolved
    • 1.8.0
    • None
    • None

    Description

      I'm using Flink 1.8.0 version and have enabled externalized checkpoint to hdfs location, we have seen few scenarios where checkpoint folders will have checkpoint files but only missing "_metadata" file. If we attempt to restore application from this path, application fails with exception "Could not find _metadata file. There is similar discussion in Flink user mailing list with subject  "Zookeeper connection loss causing checkpoint corruption" around it. I've attached sample snapshot on how folder structure looks as well.

      Attachments

        1. Checkpoints.png
          12 kB
          Arpith Prakash

        Activity

          People

            Unassigned Unassigned
            arpith_kp Arpith Prakash
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: