Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-4325

Improve spark-ec2 cluster launch times

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Resolved
    • Minor
    • Resolution: Done
    • None
    • None
    • EC2
    • None

    Description

      This is an umbrella task to capture several pieces of work related to significantly improving spark-ec2 cluster launch times.

      There are several optimizations we know we can make to setup.sh to make cluster launches faster.

      There are also some improvements to the AMIs that will help a lot.

      Potential improvements:

      • Upgrade the Spark AMIs and pre-install tools like Ganglia on them. This will reduce or eliminate SSH wait time and Ganglia init time.
      • Replace instances of download; rsync to rest of cluster with parallel downloads on all nodes of the cluster.
      • Replace instances of
        for node in $NODES; do
          command
          sleep 0.3
        done
        wait

        with simpler calls to pssh.

      • Remove the linear backoff when we wait for SSH availability now that we are already waiting for EC2 status checks to clear before testing SSH.

      Attachments

        Issue Links

          Activity

            People

              nchammas Nicholas Chammas
              nchammas Nicholas Chammas
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: