Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
When work-preserving restart is enabled, it appears the restart (or failover) is unconditionally blocked for the configured delay even if the recovery itself finishes sooner than this. This should be updated to wait for the earlier of the two conditions. Also, it would be nice to allow setting the config to -1 to indicate wait as long as need for the recovery to be completed.
Attachments
Issue Links
- duplicates
-
YARN-2567 Add a percentage-node threshold for RM to wait for new allocations after restart/failover
- Open