We have recently seen the nodemanager OOM issue reported in
YARN-8242 during a rolling upgrade. Our code is currently based on branch-2.8, but we are in the process of moving to 2.10. I checked and YARN-8242 pulls back to branch-2.10 pretty cleanly. The only conflict was a minor one in TestNMLeveldbStateStoreService.java.
- relates to
YARN-8242 YARN NM: OOM error while reading back the state store on recovery