Details
-
Improvement
-
Status: Open
-
Not a Priority
-
Resolution: Unresolved
-
1.13.0
-
None
Description
Currently, we only fail the job when we received continues "CHECKPOINT_DECLINED" message, but ignored the "timeout"/"task_failure"/"task_checkpoint_failure"/"finalize_checkpoint_failure" and so on.
In my opinion, we should put some checkpoint failure reason above into account when failing a job (not only the "CHECKPOINT_DECLINED" reason"
This issue is inspired by a [user mail list|http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Making-job-fail-on-Checkpoint-Expired-tt34051.html],
Attachments
Issue Links
- is related to
-
FLINK-17351 CheckpointCoordinator and CheckpointFailureManager ignores checkpoint timeouts
- Closed