Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-3762

Task tracker died due to OOM

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 0.18.0
    • None
    • None
    • Reviewed

    Description

      When running about 100 moderate jobs on a small cluster (with 19 Task Trackers),
      the task trackers all died due to OOM.
      I got a chance to dump the jstack strace of a task tracker before it died.
      Its image size was close 4GB!
      I saw 1200+ threads of DFSClient.LeaseChecker.
      Clearly we have a severe resource leakage problem!

      Attachments

        1. HADOOP-3762.patch
          6 kB
          Doug Cutting
        2. HADOOP-3762.patch
          3 kB
          Doug Cutting
        3. 3762_20080717.patch
          7 kB
          Tsz-wo Sze
        4. HADOOP-3762.patch
          3 kB
          Doug Cutting
        5. HADOOP-3762.patch
          2 kB
          Doug Cutting
        6. 3762_20080715c.patch
          2 kB
          Tsz-wo Sze
        7. 3762_20080715b.patch
          2 kB
          Tsz-wo Sze
        8. 3762_20080715.patch
          1 kB
          Tsz-wo Sze
        9. TaskTrackerStackTrace.txt
          335 kB
          Runping Qi

        Activity

          People

            cutting Doug Cutting
            runping Runping Qi
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: