Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-12634

Should checkpoint after restore finished

    XMLWordPrintableJSON

Details

    Description

      For state stores, Kafka Streams maintains local checkpoint files to track the offsets of the state store changelog topics. The checkpoint is updated on commit or when a task is closed cleanly.

      However, after a successful restore, the checkpoint is not written. Thus, if an instance crashes after restore but before committing, even if the state is on local disk the checkpoint file is missing (indicating that there is no state) and thus state would be restored from scratch.

      While for most cases, the time between restore end and next commit is small, there are cases when this time could be large, for example if there is no new input data to be processed (if there is no input data, the commit would be skipped).

      Thus, we should write the checkpoint file after a successful restore to close this gap (or course, only for at-least-once processing).

      Attachments

        Issue Links

          Activity

            People

              pnee Philip Nee
              mjsax Matthias J. Sax
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: