Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-40635

Scala 2.12 + Hadoop 2 + JDK 8 Daily Test failed

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.4.0
    • 3.4.0
    • Tests, YARN
    • None

    Description

      https://github.com/apache/spark/actions/runs/3164718086

       

      [error]     org.apache.spark.deploy.yarn.YarnShuffleAlternateNameConfigWithLevelDBBackendSuite
      [error]     org.apache.spark.deploy.yarn.YarnShuffleIntegrationWithLevelDBBackendSuite
      [error]     org.apache.spark.deploy.yarn.YarnClusterSuite
      [error]     org.apache.spark.deploy.yarn.YarnShuffleAuthWithLevelDBBackendSuite
      [error]     org.apache.spark.deploy.yarn.YarnShuffleAlternateNameConfigWithRocksDBBackendSuite
      [error]     org.apache.spark.deploy.yarn.YarnShuffleIntegrationWithRocksDBBackendSuite
      [error]     org.apache.spark.deploy.yarn.YarnShuffleAuthWithRocksDBBackendSuite 

      Attachments

        1. build-failed
          3.23 MB
          Yang Jie

        Activity

          gurwls223 Hyukjin Kwon added a comment - - edited

          Issue resolved by pull request 38079
          https://github.com/apache/spark/pull/38079

          gurwls223 Hyukjin Kwon added a comment - - edited Issue resolved by pull request 38079 https://github.com/apache/spark/pull/38079
          apachespark Apache Spark added a comment -

          User 'LuciferYang' has created a pull request for this issue:
          https://github.com/apache/spark/pull/38079

          apachespark Apache Spark added a comment - User 'LuciferYang' has created a pull request for this issue: https://github.com/apache/spark/pull/38079
          LuciferYang Yang Jie added a comment - - edited

          Manual test : build/sbt "yarn/test" -Phadoop-2 -Pyarn and   mvn clean test -pl resource-managers/yarn -Phadoop-2 -Pyarn locally, but they all passed.

          From the characteristics of the failed case, the am start failed because the start command line is too long, maybe due to the classpath contains too many jars(CLASSPATH and SPARK_DIST_CLASSPATH includes all jars in .cache)

          LuciferYang Yang Jie added a comment - - edited Manual test : build/sbt "yarn/test" -Phadoop-2 -Pyarn and   mvn clean test -pl resource-managers/yarn -Phadoop-2 -Pyarn locally, but they all passed. From the characteristics of the failed case, the am start failed because the start command line is too long, maybe due to the classpath contains too many jars(CLASSPATH and SPARK_DIST_CLASSPATH includes all jars in .cache)
          LuciferYang Yang Jie added a comment -
          Exception message: Cannot run program "bash" (in directory "/home/runner/work/spark/spark/resource-managers/yarn/target/org.apache.spark.deploy.yarn.YarnClusterSuite/org.apache.spark.deploy.yarn.YarnClusterSuite-localDir-nm-0_0/usercache/runner/appcache/application_1664721938509_0027/container_1664721938509_0027_02_000001"): error=7, Argument list too long
          22096[info]   Stack trace: java.io.IOException: Cannot run program "bash" (in directory "/home/runner/work/spark/spark/resource-managers/yarn/target/org.apache.spark.deploy.yarn.YarnClusterSuite/org.apache.spark.deploy.yarn.YarnClusterSuite-localDir-nm-0_0/usercache/runner/appcache/application_1664721938509_0027/container_1664721938509_0027_02_000001"): error=7, Argument list too long
          22097[info]   	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1048)
          22098[info]   	at org.apache.hadoop.util.Shell.runCommand(Shell.java:526)
          22099[info]   	at org.apache.hadoop.util.Shell.run(Shell.java:482)
          22100[info]   	at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:776)
          22101[info]   	at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:212)
          22102[info]   	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)
          22103[info]   	at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)
          22104[info]   	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
          22105[info]   	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
          22106[info]   	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
          22107[info]   	at java.lang.Thread.run(Thread.java:750)
          22108[info]   Caused by: java.io.IOException: error=7, Argument list too long
          22109[info]   	at java.lang.UNIXProcess.forkAndExec(Native Method)
          22110[info]   	at java.lang.UNIXProcess.<init>(UNIXProcess.java:247)
          22111[info]   	at java.lang.ProcessImpl.start(ProcessImpl.java:134)
          22112[info]   	at java.lang.ProcessBuilder.start(ProcessBuilder.java:1029)
          22113[info]   	... 10 more
          22114[info]    
          LuciferYang Yang Jie added a comment - Exception message: Cannot run program "bash" (in directory "/home/runner/work/spark/spark/resource-managers/yarn/target/org.apache.spark.deploy.yarn.YarnClusterSuite/org.apache.spark.deploy.yarn.YarnClusterSuite-localDir-nm-0_0/usercache/runner/appcache/application_1664721938509_0027/container_1664721938509_0027_02_000001" ): error=7, Argument list too long 22096[info] Stack trace: java.io.IOException: Cannot run program "bash" (in directory "/home/runner/work/spark/spark/resource-managers/yarn/target/org.apache.spark.deploy.yarn.YarnClusterSuite/org.apache.spark.deploy.yarn.YarnClusterSuite-localDir-nm-0_0/usercache/runner/appcache/application_1664721938509_0027/container_1664721938509_0027_02_000001" ): error=7, Argument list too long 22097[info] at java.lang.ProcessBuilder.start(ProcessBuilder.java:1048) 22098[info] at org.apache.hadoop.util.Shell.runCommand(Shell.java:526) 22099[info] at org.apache.hadoop.util.Shell.run(Shell.java:482) 22100[info] at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:776) 22101[info] at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:212) 22102[info] at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302) 22103[info] at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82) 22104[info] at java.util.concurrent.FutureTask.run(FutureTask.java:266) 22105[info] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) 22106[info] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) 22107[info] at java.lang. Thread .run( Thread .java:750) 22108[info] Caused by: java.io.IOException: error=7, Argument list too long 22109[info] at java.lang.UNIXProcess.forkAndExec(Native Method) 22110[info] at java.lang.UNIXProcess.<init>(UNIXProcess.java:247) 22111[info] at java.lang.ProcessImpl.start(ProcessImpl.java:134) 22112[info] at java.lang.ProcessBuilder.start(ProcessBuilder.java:1029) 22113[info] ... 10 more 22114[info]

          People

            LuciferYang Yang Jie
            LuciferYang Yang Jie
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: