Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2311

AM can hang if kill received while recovering from previous attempt

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.6.0
    • Fix Version/s: 0.6.2, 0.8.0-alpha, 0.7.1
    • Component/s: None
    • Labels:

      Description

      We saw an instance of a Tez job hanging despite receiving multiple kill requests from clients. The AM was recovering from a prior attempt when the first kill request arrived.

        Attachments

        1. TEZ-2311-1.patch
          18 kB
          Jeff Zhang
        2. TEZ-2311-2.patch
          18 kB
          Jeff Zhang
        3. TEZ-2311-3.patch
          38 kB
          Jeff Zhang
        4. TEZ-2311-4.patch
          46 kB
          Jeff Zhang
        5. TEZ-2311-5.patch
          47 kB
          Jeff Zhang
        6. TEZ-2311-6.patch
          46 kB
          Jeff Zhang
        7. TEZ-2311-7.patch
          45 kB
          Jeff Zhang
        8. TEZ-2311-8.patch
          45 kB
          Jeff Zhang

          Issue Links

            Activity

              People

              • Assignee:
                zjffdu Jeff Zhang
                Reporter:
                jlowe Jason Darrell Lowe
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: