Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
1.1.3
-
None
-
None
Description
This should act as a safety net in cases where the sources or the network misbehave or cannot start a checkpoint. The alignment phases would then simply abort after a while, signaling the checkpoint to be incomplete.
This needs the following changes
- Add "cancel barriers" that are sent to downstream tasks
- Barrier Buffer and Barrier Tracker need to handle the "cancel barriers"
- Tasks need to react to an aborted checkpoint by responding with an appropriate message
Attachments
Issue Links
- duplicates
-
FLINK-4975 Add a limit for how much data may be buffered during checkpoint alignment
- Closed