Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
Reviewed
Description
Recently RM full GC happened in one of our clusters, after investigating the dump memory and jstack, I found two places in RM may cause memory leaks after RM transitioned to standby:
- Release cache cleanup timer in AbstractYarnScheduler never be canceled.
- ZooKeeper connection in ZKRMStateStore never be closed.
To solve those leaks, we should close the connection or cancel the timer when services are stopping.