Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
1.5.0, 1.6.0
-
None
Description
Currently, we use the RpcService#scheduledExecutor to send heartbeat requests to remote targets. This has the problem that we still see heartbeats from this endpoint also if its main thread is currently blocked. Due to this, the heartbeat response cannot be processed and the remote target times out. On the remote side, this won't be noticed because it still receives the heartbeat requests.
A solution to this problem would be to send the heartbeat requests to the remote thread through the RPC endpoint's main thread. That way, also the heartbeats would be blocked if the main thread is blocked/busy.
Attachments
Issue Links
- duplicates
-
FLINK-12863 Race condition between slot offerings and AllocatedSlotReport
-
- Closed
-
- links to