Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-27132

CheckpointResourcesCleanupRunner might discard shared state of the initial checkpoint

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.15.0, 1.16.0
    • None
    • None

    Description

      When considering the following case: # A job starts from a checkpoint in NO_CLAIM mode, with incremental checkpoints enabled

      1. It produces some new checkpoints and subsumes the original one (not discarding shared state - before FLINK-24611 or after FLINK-26985)
      2. Job terminates abruptly
      3. The cleaner is started for that job
      4. ZK doesn't have the initial checkpoint, so the store will load only the new checkpoints (created in 2). Shared state is registered
      5. The store is shut down - discarding all the checkpoints and also any shared state

      In 6 5, if some checkpoint uses the initial state, it will also be discarded
       
      mapohl could you please confirm this?
       
      cc: yunta

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              roman Roman Khachatryan
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: