-
Type:
Bug
-
Status: Resolved
-
Priority:
Major
-
Resolution: Fixed
-
Affects Version/s: None
-
Component/s: resourcemanager
-
Labels:
-
Target Version/s:
-
Hadoop Flags:Reviewed
Recently RM full GC happened in one of our clusters, after investigating the dump memory and jstack, I found two places in RM may cause memory leaks after RM transitioned to standby:
- Release cache cleanup timer in AbstractYarnScheduler never be canceled.
- ZooKeeper connection in ZKRMStateStore never be closed.
To solve those leaks, we should close the connection or cancel the timer when services are stopping.