Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-2010

Handle app-recovery failures gracefully

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 2.3.0
    • Fix Version/s: 2.6.0
    • Component/s: resourcemanager
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Sometimes, the RM fails to recover an application. It could be because of turning security on, token expiry, or issues connecting to HDFS etc. The causes could be classified into (1) transient, (2) specific to one application, and (3) permanent and apply to multiple (all) applications. Today, the RM fails to transition to Active and ends up in STOPPED state and can never be transitioned to Active again.

      The initial stacktrace reported is at https://issues.apache.org/jira/secure/attachment/12676476/issue-stacktrace.rtf

        Attachments

        1. issue-stacktrace.rtf
          3 kB
          Karthik Kambatla
        2. YARN-2010.1.patch
          6 kB
          Rohith Sharma K S
        3. YARN-2010.12.patch
          25 kB
          Jian He
        4. YARN-2010.12.patch
          23 kB
          Jian He
        5. YARN-2010.14.patch
          26 kB
          Jian He
        6. YARN-2010.15.patch
          26 kB
          Jian He
        7. YARN-2010.16.patch
          27 kB
          Jian He
        8. YARN-2010.patch
          1 kB
          Rohith Sharma K S
        9. yarn-2010-10.patch
          37 kB
          Karthik Kambatla
        10. yarn-2010-11.patch
          20 kB
          Jian He
        11. yarn-2010-13.patch
          26 kB
          Karthik Kambatla
        12. yarn-2010-2.patch
          9 kB
          Karthik Kambatla
        13. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        14. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        15. yarn-2010-4.patch
          15 kB
          Karthik Kambatla
        16. yarn-2010-5.patch
          17 kB
          Karthik Kambatla
        17. yarn-2010-6.patch
          17 kB
          Karthik Kambatla
        18. yarn-2010-7.patch
          21 kB
          Karthik Kambatla
        19. yarn-2010-8.patch
          16 kB
          Karthik Kambatla
        20. yarn-2010-9.patch
          22 kB
          Karthik Kambatla

          Issue Links

            Activity

              People

              • Assignee:
                kasha Karthik Kambatla
                Reporter:
                bcwalrus bc Wong
              • Votes:
                0 Vote for this issue
                Watchers:
                15 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: