Details
Description
Some times the TestResourceTrackerService.testNodeRemovalNormally fails with the following message
java.lang.AssertionError: Shutdown nodes should be 0 now expected:<1> but was:<0> at org.apache.hadoop.yarn.server.resourcemanager.TestResourceTrackerService.testNodeRemovalUtilDecomToUntracked(TestResourceTrackerService.java:1723) at org.apache.hadoop.yarn.server.resourcemanager.TestResourceTrackerService.testNodeRemovalUtil(TestResourceTrackerService.java:1685) at org.apache.hadoop.yarn.server.resourcemanager.TestResourceTrackerService.testNodeRemovalNormally(TestResourceTrackerService.java:1530)
This can happen in case if the hardcoded 1s sleep in the test not enough for proper shut down.
To fix this issue we should poll the cluster status with a time out, and see the cluster can reach the expected state
Attachments
Issue Links
- links to