Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-37049

executorIdleTimeout is not working for pending pods on K8s

    XMLWordPrintableJSON

Details

    Description

      SPARK-33099 added the support to respect "spark.dynamicAllocation.executorIdleTimeout" in ExecutorPodsAllocator. However, when it checks if a pending executor pod is timed out, it checks against the pod's "startTime". A pending pod "startTime" is empty, and this causes the function "isExecutorIdleTimedOut()" always return true for pending pods.

      This caused the issue, pending pods are deleted immediately when a stage is finished and several new pods got recreated again in the next stage.

      Attachments

        Activity

          People

            wwei Weiwei Yang
            wwei Weiwei Yang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: