[SPARK-34519] ExecutorPodsAllocator use exponential backoff strategy when request executor pod failed - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Minor
Resolution: Unresolved
Affects Version/s: 3.0.1
Fix Version/s: None
Component/s: Kubernetes
Labels:
None
Environment:

spark 3.0.1

kubernetes 1.18.8

Description

create a resouce quota `kubectl create quota test --hard=cpu=20,memory=60G`
submit an application request more than quota `spark-submit --executor-cores 5 --executor-memory 10G --num-executors 10 <spark-pi.py>`
seems `ExecutorPodsAllocator: Going to request 5 executors from Kubernetes.` print every second

`spark.kubernetes.allocation.batch.delay` default is 1s, which good enough when allocation succeeded, but exponential backoff maybe an better choice when alloction failed.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Fengyu Cao

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 24/Feb/21 10:27

Updated:: 24/Feb/21 10:27