Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
1.9.0, 1.10.0
-
None
Description
I think that we introduced with FLINK-12364 an inconsistency wrt to job termination a checkpointing. In FLINK-9900 it was discovered that checkpoints can complete even after the CheckpointFailureManager decided to fail a job. I think the expected behaviour should be that we fail all pending checkpoints once the CheckpointFailureManager decides to fail the job.
Attachments
Issue Links
- causes
-
FLINK-9900 Fix unstable test ZooKeeperHighAvailabilityITCase#testRestoreBehaviourWithFaultyStateHandles
- Closed
- is caused by
-
FLINK-13698 Rework threading model of CheckpointCoordinator
- Reopened
-
FLINK-16945 Execute CheckpointFailureManager.FailJobCallback directly in main thread executor
- Closed
- is fixed by
-
FLINK-16945 Execute CheckpointFailureManager.FailJobCallback directly in main thread executor
- Closed
- is related to
-
FLINK-5960 Make CheckpointCoordinator less blocking
- Closed
- relates to
-
FLINK-13527 Instable KafkaProducerExactlyOnceITCase due to CheckpointFailureManager
- Closed