Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3397

Failed streaming jobs should fall back to the most recent checkpoint/savepoint

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.0.0
    • None
    • None

    Description

      The current fallback behaviour in case of a streaming job failure is slightly counterintuitive:

      If a job fails it will fall back to the most recent checkpoint (if any) even if there were more recent savepoint taken. This means that savepoints are not regarded as checkpoints by the system only points from where a job can be manually restarted.

      I suggest to change this so that savepoints are also regarded as checkpoints in case of a failure and they will also be used to automatically restore the streaming job.

      Attachments

        1. FLINK-3397.pdf
          117 kB
          ramkrishna.s.vasudevan

        Issue Links

          Activity

            People

              Unassigned Unassigned
              gyfora Gyula Fora
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: