Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-12619

Support TERMINATE/SUSPEND Job with Checkpoint

    XMLWordPrintableJSON

Details

    Description

      Inspired by the idea of FLINK-11458, we propose to support terminate/suspend a job with checkpoint. This improvement cooperates with incremental and external checkpoint features, that if checkpoint is retained and this feature is configured, we will trigger a checkpoint before the job stops. It could accelarate job recovery a lot since:
      1. No source rewinding required any more.
      2. It's much faster than taking a savepoint since incremental checkpoint is enabled.

      Please note that conceptually savepoints is different from checkpoint in a similar way that backups are different from recovery logs in traditional database systems. So we suggest using this feature only for job recovery, while stick with FLINK-11458 for the upgrading/cross-cluster-job-migration/state-backend-switch cases.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              klion26 Congxian Qiu
              Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m