Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-2311

AM can hang if kill received while recovering from previous attempt

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.6.0
    • 0.6.2, 0.8.0-alpha, 0.7.1
    • None

    Description

      We saw an instance of a Tez job hanging despite receiving multiple kill requests from clients. The AM was recovering from a prior attempt when the first kill request arrived.

      Attachments

        1. TEZ-2311-1.patch
          18 kB
          Jeff Zhang
        2. TEZ-2311-2.patch
          18 kB
          Jeff Zhang
        3. TEZ-2311-3.patch
          38 kB
          Jeff Zhang
        4. TEZ-2311-4.patch
          46 kB
          Jeff Zhang
        5. TEZ-2311-5.patch
          47 kB
          Jeff Zhang
        6. TEZ-2311-6.patch
          46 kB
          Jeff Zhang
        7. TEZ-2311-7.patch
          45 kB
          Jeff Zhang
        8. TEZ-2311-8.patch
          45 kB
          Jeff Zhang

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            zjffdu Jeff Zhang
            jlowe Jason Darrell Lowe
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment