Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8407

Container launch exception in AM log should be printed in ERROR level

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 3.2.0, 3.1.2
    • None
    • None
    • Reviewed

    Description

      when a container launch is failing due to docker image not available is logged as INFO level in AM log. 
      Container launch failure should be logged as ERROR.

      Steps:
      launch httpd yarn-service application with invalid docker image

       

      2018-06-07 01:51:32,966 [Component  dispatcher] INFO  instance.ComponentInstance - [COMPINSTANCE httpd-0 : container_e05_1528335963594_0001_01_000002]: container_e05_1528335963594_0001_01_000002 completed. Reinsert back to pending list and requested a new container.
      
      exitStatus=-1, diagnostics=[2018-06-07 01:51:02.363]Exception from container-launch.
      
      Container id: container_e05_1528335963594_0001_01_000002
      
      Exit code: 7
      
      Exception message: Launch container failed
      
      Shell error output: Unable to find image 'xxx/httpd:0.1' locally
      
      Trying to pull repository xxx/httpd ...
      
      /usr/bin/docker-current: Get https://xxx/v1/_ping: dial tcp: lookup xxx on yyy: no such host.
      
      See '/usr/bin/docker-current run --help'.
      
      
      Shell output: main : command provided 4
      
      main : run as user is hbase
      
      main : requested yarn user is hbase
      
      Creating script paths...
      
      Creating local dirs...
      
      Getting exit code file...
      
      Changing effective user to root...
      
      Wrote the exit code 7 to /grid/0/hadoop/yarn/local/nmPrivate/application_1528335963594_0001/container_e05_1528335963594_0001_01_000002/container_e05_1528335963594_0001_01_000002.pid.exitcode
      
      [2018-06-07 01:51:02.393]Diagnostic message from attempt :
      
      [2018-06-07 01:51:02.394]Container exited with a non-zero exit code 7. Last 4096 bytes of stderr.txt :
      
      [2018-06-07 01:51:32.428]Could not find nmPrivate/application_1528335963594_0001/container_e05_1528335963594_0001_01_000002//container_e05_1528335963594_0001_01_000002.pid in any of the directories
      
      2018-06-07 01:51:32,966 [Component  dispatcher] INFO  instance.ComponentInstance - [COMPINSTANCE httpd-0 : container_e05_1528335963594_0001_01_000002] Transitioned from STARTED to INIT on STOP event

      Attachments

        1. YARN-8407.001.patch
          2 kB
          Yesha Vora
        2. YARN-8407.002.patch
          2 kB
          Yesha Vora
        3. YARN-8407.003.patch
          2 kB
          Yesha Vora

        Activity

          People

            yeshavora Yesha Vora
            yeshavora Yesha Vora
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: