Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2311

AM can hang if kill received while recovering from previous attempt

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.6.0
    • 0.6.2, 0.8.0-alpha, 0.7.1
    • None

    Description

      We saw an instance of a Tez job hanging despite receiving multiple kill requests from clients. The AM was recovering from a prior attempt when the first kill request arrived.

      Attachments

        1. TEZ-2311-1.patch
          18 kB
          Jeff Zhang
        2. TEZ-2311-2.patch
          18 kB
          Jeff Zhang
        3. TEZ-2311-3.patch
          38 kB
          Jeff Zhang
        4. TEZ-2311-4.patch
          46 kB
          Jeff Zhang
        5. TEZ-2311-5.patch
          47 kB
          Jeff Zhang
        6. TEZ-2311-6.patch
          46 kB
          Jeff Zhang
        7. TEZ-2311-7.patch
          45 kB
          Jeff Zhang
        8. TEZ-2311-8.patch
          45 kB
          Jeff Zhang

        Issue Links

          Activity

            People

              zjffdu Jeff Zhang
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: