Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48589

Add option snapshotStartBatchId and snapshotPartitionId to state data source

    XMLWordPrintableJSON

Details

    Description

      Define two new options, snapshotStartBatchId and snapshotPartitionId, for the existing state reader. Both of them should be provided at the same time.

      1. When there is no snapshot file at that batch (note there is an off-by-one issue between version and batch Id), throw an exception.
      2. Otherwise, the reader should continue to rebuild the state by reading delta files only, and ignore all snapshot files afterwards.
      3. Note that if a batchId option is already specified. That batchId is the ending batchId, we should then end at that batchId.

      Attachments

        Issue Links

          Activity

            People

              eason-yuchen-liu Yuchen Liu
              eason-yuchen-liu Yuchen Liu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: