Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-24933

SinkProgress should report written rows

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3.1
    • 3.0.0
    • Structured Streaming
    • None

    Description

      SinkProgress should report similar properties like SourceProgress as long as they are available for given Sink. Count of written rows is metric availble for all Sinks. Since relevant progress information is with respect to commited rows, ideal object to carry this info is WriterCommitMessage. For brevity the implementation will focus only on Sinks with API V2 and on Micro Batch mode. Implemention for Continuous mode will be provided at later date.

      Before

      {"description":"org.apache.spark.sql.kafka010.KafkaSourceProvider@3c0bd317"}
      

      After

      {"description":"org.apache.spark.sql.kafka010.KafkaSourceProvider@3c0bd317","numOutputRows":5000}
      

      Attachments

        Issue Links

          Activity

            People

              vackosar Vaclav Kosar
              vackosar Vaclav Kosar
              Votes:
              7 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: