Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-41145

Assert the offset range for file stream source in Trigger.AvailableNow

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • 3.4.0
    • None
    • Structured Streaming
    • None

    Description

      We encountered the issue where the data source did not properly implement the offset with Trigger.AvailableNow, and the query ran with processing same data continuously without stopping.

      We would like to proactively avoid such case for most used data sources. I'll create a new JIRA ticket for Kafka data source as well.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              kabhwan Jungtaek Lim
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: