Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
When running about 100 moderate jobs on a small cluster (with 19 Task Trackers),
the task trackers all died due to OOM.
I got a chance to dump the jstack strace of a task tracker before it died.
Its image size was close 4GB!
I saw 1200+ threads of DFSClient.LeaseChecker.
Clearly we have a severe resource leakage problem!