Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotVotersWatch issueWatchersConvert to IssueMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 2.3.0
    • Fix Version/s: 2.8.0, 3.0.0-alpha1
    • Component/s: resourcemanager
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Currently work preserving RM restart recovers unmanaged AMs but it has a couple of shortcomings - all running containers are killed and completed unmanaged AMs are also recovered as we do not record final state for unmanaged AMs in the RM StateStore. This JIRA proposes to address both the shortcomings so that work preserving unmanaged AM recovery works exactly like with managed AMs

        Attachments

        1. YARN-1815-v6.patch
          13 kB
          Subramaniam Krishnan
        2. YARN-1815-v5.patch
          11 kB
          Subramaniam Krishnan
        3. YARN-1815-v4.patch
          10 kB
          Subramaniam Krishnan
        4. YARN-1815-v3.patch
          9 kB
          Subramaniam Krishnan
        5. yarn-1815-2.patch
          3 kB
          Karthik Kambatla
        6. yarn-1815-2.patch
          4 kB
          Karthik Kambatla
        7. yarn-1815-1.patch
          3 kB
          Karthik Kambatla
        8. Unmanaged AM recovery.png
          149 kB
          Karthik Kambatla

        Issue Links

          Activity

          $i18n.getText('security.level.explanation', $currentSelection) Viewable by All Users
          Cancel

            People

            • Assignee:
              subru Subramaniam Krishnan Assign to me
              Reporter:
              kasha Karthik Kambatla

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment