Details
Description
Steps to reproduce:
- Start YARN cluster - Enable Opportunistic containers and set NM queue length to something > 10. Also Enable work preserving restart
- Start an MR job (without opportunistic containers)
- Kill the NM and restart it again.
- In the logs - it shows that some of the containers are in SUSPENDED state - even though they are still running.