Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.4.7, 3.0.1, 3.1.1, 3.2.0
-
None
Description
If the Worker fails to send ExecutorStateChanged to the Master due to some errors, e.g., temporary network error, then the Master can't remove the finished executor normally and think the executor is still alive. In the worst case, if the executor is the only one executor for the application, the application can get hang.