Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-522 [Umbrella] Better reporting for crashed/Killed AMs and Containers
  3. YARN-2013

The diagnostics is always the ExitCodeException stack when the container crashes

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.6.0
    • Component/s: nodemanager
    • Labels:
      None

      Description

      When a container crashes, ExitCodeException will be thrown from Shell. Default/LinuxContainerExecutor captures the exception, put the exception stack into the diagnostic. Therefore, the exception stack is always the same.

              String diagnostics = "Exception from container-launch: \n"
                  + StringUtils.stringifyException(e) + "\n" + shExec.getOutput();
              container.handle(new ContainerDiagnosticsUpdateEvent(containerId,
                  diagnostics));
      

      In addition, it seems that the exception always has a empty message as there's no message from stderr. Hence the diagnostics is not of much use for users to analyze the reason of container crash.

        Attachments

        1. YARN-2013.1.patch
          4 kB
          Tsuyoshi Ozawa
        2. YARN-2013.2.patch
          4 kB
          Tsuyoshi Ozawa
        3. YARN-2013.3.patch
          4 kB
          Tsuyoshi Ozawa
        4. YARN-2013.3-2.patch
          4 kB
          Tsuyoshi Ozawa
        5. YARN-2013.4.patch
          14 kB
          Tsuyoshi Ozawa
        6. YARN-2013.5.patch
          15 kB
          Tsuyoshi Ozawa

          Issue Links

            Activity

              People

              • Assignee:
                ozawa Tsuyoshi Ozawa
                Reporter:
                zjshen Zhijie Shen
              • Votes:
                0 Vote for this issue
                Watchers:
                8 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: