Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.14.0, 1.15.0
Description
After FLINK-23041, the downstream task can be switched UC when currentTime - triggerTime > timeout. But the downstream task still needs wait for all barriers of upstream.
If the back pressure is serve, the downstream task cannot receive all barrier within CP timeout, causes CP to fail.
Can we support upstream Task switching from Aligned to UC? It means that when the barrier cannot be sent from the output buffer to the downstream task within the execution.checkpointing.aligned-checkpoint-timeout, the upstream task switches to UC and takes a snapshot of the data before the barrier in the output buffer.
Hi akalashnikov , please help take a look in your free time, thanks a lot.
Attachments
Issue Links
- causes
-
FLINK-28077 Tasks get stuck during cancellation in ChannelStateWriteRequestExecutorImpl
- Closed
-
FLINK-31588 The unaligned checkpoint type is wrong at subtask level
- Closed
-
FLINK-31864 Generate unaligned checkpoint type based on persisted data
- Open
- relates to
-
FLINK-31588 The unaligned checkpoint type is wrong at subtask level
- Closed
- links to