Details
-
Umbrella
-
Status: Resolved
-
Minor
-
Resolution: Done
-
None
-
None
-
None
Description
This is an umbrella task to capture several pieces of work related to significantly improving spark-ec2 cluster launch times.
There are several optimizations we know we can make to setup.sh to make cluster launches faster.
There are also some improvements to the AMIs that will help a lot.
Potential improvements:
- Upgrade the Spark AMIs and pre-install tools like Ganglia on them. This will reduce or eliminate SSH wait time and Ganglia init time.
- Replace instances of download; rsync to rest of cluster with parallel downloads on all nodes of the cluster.
- Replace instances of
for node in $NODES; do command sleep 0.3 done wait
with simpler calls to pssh.
- Remove the linear backoff when we wait for SSH availability now that we are already waiting for EC2 status checks to clear before testing SSH.
Attachments
Issue Links
- contains
-
SPARK-5189 Reorganize EC2 scripts so that nodes can be provisioned independent of Spark master
- Resolved
-
SPARK-3821 Develop an automated way of creating Spark images (AMI, Docker, and others)
- Closed
- links to
1.
|
Use EC2 status checks to know when to test SSH availability | Resolved | Nicholas Chammas | |
2.
|
Use pssh instead of bash-isms and remove unnecessary operations | Resolved | Nicholas Chammas | |
3.
|
Bake common tools like ganglia into Spark AMI | Resolved | Unassigned |