Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
1.0.0
-
None
Description
This is a ticket to track progress on redesigning the SparkListener and JobProgressListener API.
There are multiple problems with the current design, including:
0. I'm not sure if the API is usable in Java (there are at least some enums we used in Scala and a bunch of case classes that might complicate things).
1. The whole API is marked as DeveloperApi, because we haven't paid a lot of attention to it yet. Something as important as progress reporting deserves a more stable API.
2. There is no easy way to connect jobs with stages. Similarly, there is no easy way to connect job groups with jobs / stages.
3. JobProgressListener itself has no encapsulation at all. States can be arbitrarily mutated by external programs. Variable names are sort of randomly decided and inconsistent.
We should just revisit these and propose a new, concrete design.
Attachments
Issue Links
- blocks
-
SPARK-4017 Progress bar in console
- Resolved
- is duplicated by
-
SPARK-2633 enhance spark listener API to gather more spark job information
- Resolved
- is related to
-
SPARK-2633 enhance spark listener API to gather more spark job information
- Resolved
-
SPARK-2636 Expose job ID in JobWaiter API
- Resolved
-
SPARK-576 Design and develop a more precise progress estimator
- Resolved
-
SPARK-3059 Spark internal module interface design
- Closed
- relates to
-
SPARK-3644 REST API for Spark application info (jobs / stages / tasks / storage info)
- Resolved
-
SPARK-4440 Enhance the job progress API to expose more information
- Resolved
-
SPARK-4567 Make SparkJobInfo and SparkStageInfo serializable
- Resolved
- links to