Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-1494

DAG hangs waiting for ShuffleManager.getNextInput()

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.5.1
    • None
    • Reviewed

    Description

      Attaching the DAG and the stack trace of the hung process.

      Thread 30071: (state = BLOCKED)

      • sun.misc.Unsafe.park(boolean, long) @bci=0 (Interpreted frame)
      • java.util.concurrent.locks.LockSupport.park(java.lang.Object) @bci=14, line=186 (Interpreted frame)
      • java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await() @bci=42, line=2043 (Interpreted frame)
      • java.util.concurrent.LinkedBlockingQueue.take() @bci=29, line=442 (Interpreted frame)
      • org.apache.tez.runtime.library.shuffle.common.impl.ShuffleManager.getNextInput() @bci=67, line=610 (Interpreted frame)
      • org.apache.tez.runtime.library.common.readers.UnorderedKVReader.moveToNextInput() @bci=26, line=176 (Interpreted frame)
      • org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next() @bci=30, line=117 (Interpreted frame)

      Attachments

        1. TEZ-1494.5.patch
          29 kB
          Rajesh Balamohan
        2. TEZ-1494.4.patch
          29 kB
          Rajesh Balamohan
        3. TEZ-1494.3.patch
          26 kB
          Rajesh Balamohan
        4. TEZ-1494.2.patch
          11 kB
          Rajesh Balamohan
        5. TEZ-1494.1.patch
          7 kB
          Rajesh Balamohan
        6. TEZ-1494-DAG.dot
          6 kB
          Rajesh Balamohan

        Activity

          People

            rajesh.balamohan Rajesh Balamohan
            rajesh.balamohan Rajesh Balamohan
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: