Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3398

Have spark-ec2 intelligently wait for specific cluster states

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.2.0
    • EC2
    • None

    Description

      spark-ec2 currently has retry logic for when it tries to install stuff on a cluster and for when it tries to destroy security groups.

      It would be better to have some logic that allows spark-ec2 to explicitly wait for when all the nodes in a cluster it is working on have reached a specific state.

      Examples:

      • Wait for all nodes to be up
      • Wait for all nodes to be up and accepting SSH connections (then start installing stuff)
      • Wait for all nodes to be down
      • Wait for all nodes to be terminated (then delete the security groups)

      Having a function in the spark_ec2.py script that blocks until the desired cluster state is reached would reduce the need for various retry logic. It would probably also eliminate the need for the --wait parameter.

      Attachments

        Issue Links

          Activity

            People

              nchammas Nicholas Chammas
              nchammas Nicholas Chammas
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: