Details
-
Bug
-
Status: Resolved
-
Blocker
-
Resolution: Fixed
-
1.19.0, 1.18.1, 1.18.2
-
Fixes a bug where the leader election wasn't able to pick up leadership again after renewing the lease token caused a leadership loss. This required fabric8io:kubernetes-client to be upgraded from v6.6.2 to v6.9.0.
Description
The observation is that Job manager goes to suspend state with a failed container not able to register itself to resource manager after timeout.
JM Log, see attached
Attachments
Attachments
Issue Links
- causes
-
FLINK-34243 Update fabric8io:kubernetes-client to v6.9.0+ to allow the usage of ReleaseOnCall
- Closed
- split to
-
FLINK-34333 Fix FLINK-34007 LeaderElector bug in 1.18
- Resolved
- Testing discovered
-
FLINK-34343 ResourceManager registration is not completed when registering the JobMaster
- Closed
- links to
- mentioned in
-
Page Loading...
(1 links to, 1 mentioned in)