Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-73

Deadlock in JobTracker initJobs

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Duplicate
    • None
    • None
    • None
    • None
    • Hadoop 0.19.1 in a test environment with 2 Task Trackers

    Description

      Found one Java-level deadlock:
      =============================
      "SocketListener0-26":
      waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
      which is held by "IPC Server handler 1 on 9001"
      "IPC Server handler 1 on 9001":
      waiting to lock monitor 0x08f7da88 (object 0x5744f5b8, a org.apache.hadoop.mapred.JobInProgress),
      which is held by "initJobs"
      "initJobs":
      waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
      which is held by "IPC Server handler 1 on 9001"

      Java stack information for the threads listed above:
      ===================================================
      "SocketListener0-26":
      at org.apache.hadoop.mapred.JobTracker.getClusterStatus(JobTracker.java:2313)

      • waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.jobtracker_jsp._jspService(jobtracker_jsp.java:104)
        at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:94)
        at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
        at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
        at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
        at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
        at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
        at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
        at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
        at org.mortbay.http.HttpServer.service(HttpServer.java:954)
        at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
        at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
        at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
        at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
        at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
        at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
        "IPC Server handler 1 on 9001":
        at org.apache.hadoop.mapred.JobInProgress.obtainTaskCleanupTask(JobInProgress.java:935)
      • waiting to lock <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobTracker.getSetupAndCleanupTasks(JobTracker.java:2167)
      • locked <0x56795708> (a java.util.TreeMap)
      • locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.JobTracker.heartbeat(JobTracker.java:1902)
      • locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at sun.reflect.GeneratedMethodAccessor3278.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:481)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:894)
        "initJobs":
        at org.apache.hadoop.mapred.JobTracker.finalizeJob(JobTracker.java:1539)
      • waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.JobInProgress.garbageCollect(JobInProgress.java:2320)
      • locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobInProgress.terminateJob(JobInProgress.java:2004)
      • locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:472)
      • locked <0x575b7ec8> (a org.apache.hadoop.mapred.JobInProgress$JobInitKillStatus)
      • locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.EagerTaskInitializationListener$JobInitThread.run(EagerTaskInitializationListener.java:55)

      Found 1 deadlock.

      Attachments

        Activity

          People

            Unassigned Unassigned
            davelatham Dave Latham
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: