Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.3.1
-
None
Description
SinkProgress should report similar properties like SourceProgress as long as they are available for given Sink. Count of written rows is metric availble for all Sinks. Since relevant progress information is with respect to commited rows, ideal object to carry this info is WriterCommitMessage. For brevity the implementation will focus only on Sinks with API V2 and on Micro Batch mode. Implemention for Continuous mode will be provided at later date.
Before
{"description":"org.apache.spark.sql.kafka010.KafkaSourceProvider@3c0bd317"}
After
{"description":"org.apache.spark.sql.kafka010.KafkaSourceProvider@3c0bd317","numOutputRows":5000}
Attachments
Issue Links
- is depended upon by
-
SPARK-24647 Sink Should Return Writen Offsets For ProgressReporting
- Resolved
- is related to
-
SPARK-24863 Report offset lag as a custom metrics for Kafka structured streaming source
- Resolved
-
SPARK-21313 ConsoleSink's string representation
- Resolved
- relates to
-
SPARK-24748 Support for reporting custom metrics via Streaming Query Progress
- Resolved
- links to