Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-40826

Add additional checkpoint rename file check

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.4.0
    • 3.4.0
    • Structured Streaming
    • None

    Description

      We encountered an issue recently that one customer's structured streaming job failed to read delta file.

      The temporary file exists but it was not successfully renamed to final delta file path.

      We currently don't check if renamed file exists but assume it successful. As the result, failing to read delta file assumed to be committed in last batch makes re-triggering the job impossible.

      We should be able to do a check against checkpoint renamed file to prevent such difficulty in advance.

      Attachments

        Activity

          People

            viirya L. C. Hsieh
            viirya L. C. Hsieh
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: