Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
-
Reviewed
Description
- Submit application 10K application to default queue.
- All applications are in accepted state
- Now restart resourcemanager
For each application recovery LeafQueue#activateApplications() is invoked.Resulting in AM limit check to be done even before Node managers are getting registered.
Total iteration for N application is about N(N+1)/2 for 10K application 50000000 iterations causing time take for Rm to be active more than 10 min.
Since NM resources are not yet added to during recovery we should skip activateApplicaiton()
Attachments
Attachments
Issue Links
- breaks
-
YARN-5994 TestCapacityScheduler.testAMLimitUsage fails intermittently
- Resolved