Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-2010

Handle app-recovery failures gracefully

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 2.3.0
    • Fix Version/s: 2.6.0
    • Component/s: resourcemanager
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      Sometimes, the RM fails to recover an application. It could be because of turning security on, token expiry, or issues connecting to HDFS etc. The causes could be classified into (1) transient, (2) specific to one application, and (3) permanent and apply to multiple (all) applications. Today, the RM fails to transition to Active and ends up in STOPPED state and can never be transitioned to Active again.

      The initial stacktrace reported is at https://issues.apache.org/jira/secure/attachment/12676476/issue-stacktrace.rtf

        Attachments

        1. yarn-2010-9.patch
          22 kB
          Karthik Kambatla
        2. yarn-2010-8.patch
          16 kB
          Karthik Kambatla
        3. yarn-2010-7.patch
          21 kB
          Karthik Kambatla
        4. yarn-2010-6.patch
          17 kB
          Karthik Kambatla
        5. yarn-2010-5.patch
          17 kB
          Karthik Kambatla
        6. yarn-2010-4.patch
          15 kB
          Karthik Kambatla
        7. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        8. yarn-2010-3.patch
          9 kB
          Karthik Kambatla
        9. yarn-2010-2.patch
          9 kB
          Karthik Kambatla
        10. yarn-2010-13.patch
          26 kB
          Karthik Kambatla
        11. yarn-2010-11.patch
          20 kB
          Jian He
        12. yarn-2010-10.patch
          37 kB
          Karthik Kambatla
        13. YARN-2010.patch
          1 kB
          Rohith Sharma K S
        14. YARN-2010.16.patch
          27 kB
          Jian He
        15. YARN-2010.15.patch
          26 kB
          Jian He
        16. YARN-2010.14.patch
          26 kB
          Jian He
        17. YARN-2010.12.patch
          23 kB
          Jian He
        18. YARN-2010.12.patch
          25 kB
          Jian He
        19. YARN-2010.1.patch
          6 kB
          Rohith Sharma K S
        20. issue-stacktrace.rtf
          3 kB
          Karthik Kambatla

          Issue Links

            Activity

              People

              • Assignee:
                kasha Karthik Kambatla
                Reporter:
                bcwalrus bc Wong
              • Votes:
                0 Vote for this issue
                Watchers:
                15 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: