Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Done
-
None
-
None
-
None
Description
The SparkClientImpl#startDriver currently looks for the SPARK_HOME directory and invokes the bin/spark-submit script, which spawns a separate process to run the Spark application.
SparkLauncher was added in SPARK-4924 and is a programatic way to launch Spark applications.
I see a few advantages:
- No need to spawn a separate process to launch a HoS --> lower startup time
- Simplifies the code in SparkClientImpl --> easier to debug
- SparkLauncher#startApplication returns a SparkAppHandle which contains some useful utilities for querying the state of the Spark job
- It also allows the launcher to specify a list of job listeners
Attachments
Attachments
Issue Links
- is related to
-
HIVE-18533 Add option to use InProcessLauncher to submit spark jobs
- Closed
- links to