Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3335

DAG client thinks app is still running when app status is null

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.7.1
    • 0.7.2, 0.9.0, 0.8.5
    • None
    • None

    Description

      When an RM restarts without recovering apps (i.e.: either work-preserving is not enabled or state store was removed) and the YARN application history is enabled then YarnClient can return an application report with the app status as null. The RM doesn't know about the application, so the client redirects to the AHS. The AHS knows the app started at some point but will never received a finished event, hence the null app status.

      The DAG client fails to detect this scenario and believes the app is still running, so for example Hive clients will continue to hammer for status on an app that doesn't exist.

      Attachments

        1. TEZ-3335.001.patch
          5 kB
          Jason Darrell Lowe

        Issue Links

          Activity

            People

              jlowe Jason Darrell Lowe
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: