Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
0.9.0
-
None
-
None
Description
EC instances are sometimes slow to start up. When this happens, generating the cluster ssh key or sending the generated cluster key to the slaves can fail due to an ssh timeout.
The script currently hard-codes the number of tries for ssh operations as 2.
For more flexibility, it should be possible to specify the number of tries with a command-line option, --num-ssh-tries, that defaults to 2 to keep the current behavior if not provided.
Attachments
Issue Links
- relates to
-
SPARK-3398 Have spark-ec2 intelligently wait for specific cluster states
- Resolved