Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Won't Do
-
1.5.0
-
None
-
None
Description
The Dispatcher currently doesn't confirm leadership until all jobs are recovered. This prevents any operations that require an active Dispatcher from working until after job recovery. This is primarily done to prevent race conditions between client retries and recovering jobs. An alternative approach would be to only block job submission while recovery is happening.
Note: we also need to check that no other RPCs change the internal state in such a way that it interferes with the job recovery.