Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-522 [Umbrella] Better reporting for crashed/Killed AMs and Containers
  3. YARN-2013

The diagnostics is always the ExitCodeException stack when the container crashes

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.6.0
    • nodemanager
    • None

    Description

      When a container crashes, ExitCodeException will be thrown from Shell. Default/LinuxContainerExecutor captures the exception, put the exception stack into the diagnostic. Therefore, the exception stack is always the same.

              String diagnostics = "Exception from container-launch: \n"
                  + StringUtils.stringifyException(e) + "\n" + shExec.getOutput();
              container.handle(new ContainerDiagnosticsUpdateEvent(containerId,
                  diagnostics));
      

      In addition, it seems that the exception always has a empty message as there's no message from stderr. Hence the diagnostics is not of much use for users to analyze the reason of container crash.

      Attachments

        1. YARN-2013.1.patch
          4 kB
          Tsuyoshi Ozawa
        2. YARN-2013.2.patch
          4 kB
          Tsuyoshi Ozawa
        3. YARN-2013.3.patch
          4 kB
          Tsuyoshi Ozawa
        4. YARN-2013.3-2.patch
          4 kB
          Tsuyoshi Ozawa
        5. YARN-2013.4.patch
          14 kB
          Tsuyoshi Ozawa
        6. YARN-2013.5.patch
          15 kB
          Tsuyoshi Ozawa

        Issue Links

          Activity

            People

              ozawa Tsuyoshi Ozawa
              zjshen Zhijie Shen
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: