Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-522 [Umbrella] Better reporting for crashed/Killed AMs and Containers
  3. YARN-2013

The diagnostics is always the ExitCodeException stack when the container crashes

VotersStop watchingWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.6.0
    • nodemanager
    • None

    Description

      When a container crashes, ExitCodeException will be thrown from Shell. Default/LinuxContainerExecutor captures the exception, put the exception stack into the diagnostic. Therefore, the exception stack is always the same.

              String diagnostics = "Exception from container-launch: \n"
                  + StringUtils.stringifyException(e) + "\n" + shExec.getOutput();
              container.handle(new ContainerDiagnosticsUpdateEvent(containerId,
                  diagnostics));
      

      In addition, it seems that the exception always has a empty message as there's no message from stderr. Hence the diagnostics is not of much use for users to analyze the reason of container crash.

      Attachments

        1. YARN-2013.5.patch
          15 kB
          Tsuyoshi Ozawa
        2. YARN-2013.4.patch
          14 kB
          Tsuyoshi Ozawa
        3. YARN-2013.3-2.patch
          4 kB
          Tsuyoshi Ozawa
        4. YARN-2013.3.patch
          4 kB
          Tsuyoshi Ozawa
        5. YARN-2013.2.patch
          4 kB
          Tsuyoshi Ozawa
        6. YARN-2013.1.patch
          4 kB
          Tsuyoshi Ozawa

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ozawa Tsuyoshi Ozawa
            zjshen Zhijie Shen
            Votes:
            0 Vote for this issue
            Watchers:
            8 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment