Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
The contract of `MasterTriggerRestoreHook#triggerCheckpoint` (@PublicEvolving api) has changed - we now require it to be non-blocking. Any heavy operation should be executed asynchronously, for example using the given executor.
Description
Currently we snapshot the master hook state synchronously. As a part of reworking threading model of CheckpointCoordinator, we have to make this non-blocking to satisfy the requirement of running in main thread.
The behavior of snapshotting master hook state is similar to task state snapshotting. Master state snapshotting is taken before task state snapshotting. Because in master hook, there might be external system initialization which task state snapshotting might depend on.
Attachments
Issue Links
- links to