Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
1.10.0, 1.11.0
Description
In order to harden the TM and JM communication I suggest to let the TaskExecutor send the task statuses back to the JobMaster as part of the heartbeat payload (similar to FLINK-11059). This would allow to reconcile the states of both components in case that a status update message was lost as described by a user on the ML.
Attachments
Issue Links
- causes
-
FLINK-18533 Race condition between task acknowledgement and first heartbeat
- Closed
- is related to
-
FLINK-19954 Move execution deployment tracking logic from legacy EG code to SchedulerNG
- Open
- links to
- mentioned in
-
Page Loading...