Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-43183

Move update event on idleness in streaming query listener to separate callback method

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.5.0
    • 3.5.0
    • Structured Streaming
    • None

    Description

      People has been having a lot of confusions about update event on idleness; it’s not only the matter of understanding but also comes up with various types of complaints. For example, since we give the latest batch ID for update event on idleness, if the listener implementation blindly performs upsert based on batch ID, they are in risk to lose metrics.

      This also complicates the logic because we have to memorize the execution for the previous batch, which is arguably not necessary.

      Because of this, we’d be better to move the idle event out of progress update event and have separate callback method for this.

      Attachments

        Activity

          People

            kabhwan Jungtaek Lim
            kabhwan Jungtaek Lim
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: