Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.0.0-beta1
-
None
-
Reviewed
Description
Many of the RM unit tests are failing when using the FairScheduler.
Here is a list of affected test classes:
TestYarnClient TestApplicationCleanup TestApplicationMasterLauncher TestDecommissioningNodesWatcher TestKillApplicationWithRMHA TestNodeBlacklistingOnAMFailures TestRM TestRMAdminService TestRMRestart TestResourceTrackerService TestWorkPreservingRMRestart TestAMRMRPCNodeUpdates TestAMRMRPCResponseId TestAMRestart TestApplicationLifetimeMonitor TestNodesListManager TestRMContainerImpl TestAbstractYarnScheduler TestSchedulerUtils TestFairOrderingPolicy TestAMRMTokens TestDelegationTokenRenewer
Most of the test methods in these classes are failing, though some do succeed.
There's two main categories of issues:
- The test submits an application to the MockRM and waits for it to enter a specific state, which it never does, and the test times out. We need to call update() on the scheduler.
- The test throws a ClassCastException on FSQueueMetrics to CSQueueMetrics. This is because QueueMetrics metrics are static, and a previous test using FairScheduler initialized it, and the current test is using CapacityScheduler. We need to reset the metrics.