Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-29109

Checkpoint path conflict with stateless upgrade mode

    XMLWordPrintableJSON

Details

    Description

      A stateful job with stateless upgrade mode (yes, there are such use cases) fails with checkpoint path conflict due to constant jobId and FLINK-19358 (applies to Flink < 1.16x). Since with stateless upgrade mode the checkpoint id resets on restart the job is going to write to previously used locations and fail. The workaround is to rotate the jobId on every redeploy when the upgrade mode is stateless. While this can be worked around externally it is best done in the operator itself because reconciliation resolves when a restart is actually required while rotating jobId externally may trigger unnecessary restarts.

      Attachments

        Activity

          People

            thw Thomas Weise
            thw Thomas Weise
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: