Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
1.13.2, 1.14.4, 1.15.0
Description
Currently, when a the method "SplitEnumeratorContext.runInCoordinatorThread()" throws an exception, the effect is a process kill of the JobManager process.
The chain how the process kill happens is:
- An exception bubbling up in the executor, killing the executor thread
- The executor starts a replacement thread, which is forbidden by the thread factory (as a safety net) and causes a process kill.
We should prevent such exceptions from bubbling up in the coordinator executor.