Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-2010

Handle app-recovery failures gracefully

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 2.3.0
    • 2.6.0
    • resourcemanager
    • None
    • Reviewed

    Description

      Sometimes, the RM fails to recover an application. It could be because of turning security on, token expiry, or issues connecting to HDFS etc. The causes could be classified into (1) transient, (2) specific to one application, and (3) permanent and apply to multiple (all) applications. Today, the RM fails to transition to Active and ends up in STOPPED state and can never be transitioned to Active again.

      The initial stacktrace reported is at https://issues.apache.org/jira/secure/attachment/12676476/issue-stacktrace.rtf

      Attachments

        1. YARN-2010.patch
          1 kB
          Rohith Sharma K S
        2. YARN-2010.1.patch
          6 kB
          Rohith Sharma K S
        3. yarn-2010-2.patch
          9 kB
          Karthik Kambatla
        4. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        5. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        6. yarn-2010-4.patch
          15 kB
          Karthik Kambatla
        7. issue-stacktrace.rtf
          3 kB
          Karthik Kambatla
        8. yarn-2010-5.patch
          17 kB
          Karthik Kambatla
        9. yarn-2010-6.patch
          17 kB
          Karthik Kambatla
        10. yarn-2010-7.patch
          21 kB
          Karthik Kambatla
        11. yarn-2010-8.patch
          16 kB
          Karthik Kambatla
        12. yarn-2010-9.patch
          22 kB
          Karthik Kambatla
        13. yarn-2010-10.patch
          37 kB
          Karthik Kambatla
        14. yarn-2010-11.patch
          20 kB
          Jian He
        15. YARN-2010.12.patch
          23 kB
          Jian He
        16. YARN-2010.12.patch
          25 kB
          Jian He
        17. yarn-2010-13.patch
          26 kB
          Karthik Kambatla
        18. YARN-2010.14.patch
          26 kB
          Jian He
        19. YARN-2010.15.patch
          26 kB
          Jian He
        20. YARN-2010.16.patch
          27 kB
          Jian He

        Issue Links

          Activity

            People

              kasha Karthik Kambatla
              bcwalrus bc Wong
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: