Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-35799

Fix the allUpdatesTimeMs metric measuring in FlatMapGroupsWithStateExec

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.1.2
    • 3.2.0, 3.1.3
    • Structured Streaming
    • None

    Description

      Metric allUpdatesTimeMs meant to capture the start to end walltime of the operator FlatMapGroupsWithStateExec, but currently it just captures the iterator creation time. 

      Fix it to measure similar to how other stateful operators measure. Example one here. This measurement is not perfect due to the nature of the lazy iterator and also includes the time the consumer operator spent in processing the current operator output, but it should give a good signal when comparing the metric in one microbatch to the metric in another microbatch.

      Attachments

        Activity

          People

            vkorukanti Venki Korukanti
            vkorukanti Venki Korukanti
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: