Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-3762

Task tracker died due to OOM

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.18.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      When running about 100 moderate jobs on a small cluster (with 19 Task Trackers),
      the task trackers all died due to OOM.
      I got a chance to dump the jstack strace of a task tracker before it died.
      Its image size was close 4GB!
      I saw 1200+ threads of DFSClient.LeaseChecker.
      Clearly we have a severe resource leakage problem!

        Attachments

        1. TaskTrackerStackTrace.txt
          335 kB
          Runping Qi
        2. HADOOP-3762.patch
          2 kB
          Doug Cutting
        3. HADOOP-3762.patch
          3 kB
          Doug Cutting
        4. HADOOP-3762.patch
          3 kB
          Doug Cutting
        5. HADOOP-3762.patch
          6 kB
          Doug Cutting
        6. 3762_20080717.patch
          7 kB
          Tsz-wo Sze
        7. 3762_20080715c.patch
          2 kB
          Tsz-wo Sze
        8. 3762_20080715b.patch
          2 kB
          Tsz-wo Sze
        9. 3762_20080715.patch
          1 kB
          Tsz-wo Sze

          Activity

            People

            • Assignee:
              cutting Doug Cutting
              Reporter:
              runping Runping Qi
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: