Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-25743

New executors are not launched for kubernetes spark thrift on deleting existing executors

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • 2.2.0
    • None
    • Kubernetes, Spark Core

    Description

      Launched spark thrift in kubernetes cluster with dynamic allocation enabled.

      Configurations set : 

      spark.executor.memory=35g
      spark.executor.cores=8
      spark.dynamicAllocation.enabled=true
      spark.dynamicAllocation.executorIdleTimeout=10
      spark.dynamicAllocation.cachedExecutorIdleTimeout=15
      spark.driver.memory=10g
      spark.driver.cores=4
      spark.sql.crossJoin.enabled=true
      spark.sql.starJoinOptimization=true
      spark.sql.codegen=true
      spark.rpc.numRetries=5
      spark.rpc.retry.wait=5
      spark.sql.broadcastTimeout=1200
      spark.network.timeout=1800
      spark.dynamicAllocation.maxExecutors=15
      spark.kubernetes.allocation.batch.size=2
      spark.kubernetes.allocation.batch.delay=9
      spark.serializer=org.apache.spark.serializer.KryoSerializer
      spark.kubernetes.node.selector.is_control=false

      Tried to run TPCDS queries , on a 1TB parquet snappy data . 

      Found that as the execution progress, the tasks are done by a single executor ( executor 53 ) and no new executors are getting spawned, even though there is enough resources to spawn more executors.

       

      Tried to manually delete the executor pod 53 and saw that no new executor has been spawned to replace the one which is running.

      Attcahed the 

      Attachments

        1. query_0_correct.sql
          159 kB
          neenu
        2. driver.log
          7.99 MB
          neenu

        Activity

          People

            Unassigned Unassigned
            nanndomi neenu
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: