Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2426

Ensure the eventRouter thread completes before switching to a new task and thread safety fixes in IPOContexts.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.7.0
    • 0.7.0
    • None
    • None

    Description

      Sequence of events
      1) Task A starts in a container
      2) Task A complete event comes to AM
      3) Task B starts in the same container
      4) Task A's input calls some method on its context. Crashes with NPE
      5) The crash sends an input failed event for Task A to the AM
      6) Task A state machine crashes saying cannot handle failed after success

      In some cases, it could be that status update event is also sent after completion, though not sure if its related to the failed event being sent.

      Attachments

        1. TEZ-2426-3.patch
          1 kB
          Daniel Dai
        2. TEZ-2426.addendum.txt
          2 kB
          Siddharth Seth
        3. TEZ-2426.2.txt
          22 kB
          Siddharth Seth
        4. TEZ-2426.1.txt
          21 kB
          Siddharth Seth
        5. container.log
          319 kB
          Bikas Saha
        6. am.log
          9 kB
          Bikas Saha

        Issue Links

          Activity

            People

              sseth Siddharth Seth
              bikassaha Bikas Saha
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: