Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-39593

Configurable State Checkpointing Frequency in Structured Streaming

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.0
    • None
    • Structured Streaming
    • None

    Description

      Currently, for stateful pipelines state changes are checkpointed for every micro-batch. State checkpoints can contribute significantly to the latency of a micro-batch.  If state is checkpointed less frequently, its effect on batch latency can be amortized.  This can be used in conjunction with asynchronous state checkpointing to further reduce the cost in latency state checkpointing may incur.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jerrypeng Boyang Jerry Peng
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: