Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-24748 Support for reporting custom metrics via Streaming Query Progress
  3. SPARK-24863

Report offset lag as a custom metrics for Kafka structured streaming source

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • 2.4.0
    • None
    • Structured Streaming

    Description

      We can build on top of SPARK-24748 to report offset lag as a custom metrics for Kafka structured streaming source.

      This is the difference between the latest offsets in Kafka the time the metrics is reported (just after a micro-batch completes) and the latest offset Spark has processed. It can be 0 (or close to 0) if spark keeps up with the rate at which messages are ingested into Kafka topics in steady state.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              arunmahadevan Arun Mahadevan
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: