Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8274

Docker command error during container relaunch

    Details

    • Type: Task
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.2.0, 3.1.1
    • Component/s: None
    • Labels:
    • Target Version/s:

      Description

      I initiated container relaunch with a "sleep 60; exit 1" launch command and saw a "not a docker command" error on relaunch. Haven't figured out why this is happening, but it seems like it has been introduced recently to trunk/branch-3.1. cc Shane Kumpf Eric Badger

      org.apache.hadoop.yarn.server.nodemanager.containermanager.runtime.ContainerExecutionException: Relaunch container failed
              at org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.runtime.DockerLinuxContainerRuntime.relaunchContainer(DockerLinuxContainerRuntime.java:954)
              at org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.runtime.DelegatingLinuxContainerRuntime.relaunchContainer(DelegatingLinuxContainerRuntime.java:150)
              at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.handleLaunchForLaunchType(LinuxContainerExecutor.java:562)
              at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.relaunchContainer(LinuxContainerExecutor.java:486)
              at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.relaunchContainer(ContainerLaunch.java:504)
              at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:111)
              at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:47)
              at java.util.concurrent.FutureTask.run(FutureTask.java:266)
              at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
              at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
              at java.lang.Thread.run(Thread.java:748)
      2018-05-09 21:41:46,631 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Exception from container-launch.
      2018-05-09 21:41:46,631 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Container id: container_1525897486447_0003_01_000002
      2018-05-09 21:41:46,631 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Exit code: 7
      2018-05-09 21:41:46,631 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Exception message: Relaunch container failed
      2018-05-09 21:41:46,631 INFO org.apache.hadoop.yarn.server.nodemanager.ContainerExecutor: Shell error output: docker: 'container_1525897486447_0003_01_000002' is not a docker command.
      

        Attachments

        1. YARN-8274.001.patch
          2 kB
          Jason Lowe
        2. YARN-8274.002.patch
          4 kB
          Jason Lowe

          Issue Links

            Activity

              People

              • Assignee:
                jlowe Jason Lowe
                Reporter:
                billie.rinaldi Billie Rinaldi
              • Votes:
                0 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: