Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-3387

Previous AM's container complete message couldn't pass to current am if am restarted and rm changed

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      suppose am work preserving and rm ha is enabled.
      container complete message is passed to appattemt.justFinishedContainers in rm。in normal situation,all attempt in one app shares the same justFinishedContainers, but when rm changed, every attempt has it's own justFinishedContainers, so in situations below, container complete message couldn't passed to am:
      1, am restart
      2, rm changes
      3, container launched by first am completes
      container complete message will be passed to appAttempt1 not appAttempt2, but am pull finished containers from appAttempt2 (currentAppAttempt)

      Attachments

        1. YARN-3387.001.patch
          2 kB
          sandflee
        2. YARN-3387.002.patch
          7 kB
          sandflee

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sandflee sandflee
            sandflee sandflee
            Votes:
            0 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment