Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
Noticed some unit tests, for example:
TestRMRestart#testRMRestartAfterPreemption
TestAMRestart#testShouldNotCountFailureToMaxAttemptRetry
Sometimes failure because retrying app attempt registers before the previous scheduler-application-attempt completely completed in scheduler.
We need to wait scheduler-application-attempt stop before retrying following attempts.
Attachments
Attachments
Issue Links
- relates to
-
YARN-4497 RM might fail to restart when recovering apps whose attempts are missing
- Resolved