Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-4808 Allow skipping failed checkpoints
  3. FLINK-4810

Checkpoint Coordinator should fail ExecutionGraph after "n" unsuccessful checkpoints

    XMLWordPrintableJSON

Details

    Description

      The Checkpoint coordinator should track the number of consecutive unsuccessful checkpoints.

      If more than n (configured value) checkpoints fail in a row, it should call fail() on the execution graph to trigger a recovery.

      The design document is here : https://docs.google.com/document/d/1ce7RtecuTxcVUJlnU44hzcO2Dwq9g4Oyd8_biy94hJc/edit?usp=sharing

      Attachments

        Issue Links

          Activity

            People

              yanghua vinoyang
              sewen Stephan Ewen
              Votes:
              2 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: