Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-20222

The CheckpointCoordinator should reset the OperatorCoordinators when fail before the first checkpoint.

    XMLWordPrintableJSON

    Details

      Description

      Right now, if a job failed before the first successful checkpoint, the CheckpointCoordinator will not reset the OperatorCoordinator state. This may leave the OperatorCoordinators in inconsistent state.

      The CheckpointCoordinator should also reset the OperatorCoordinator state in this case, just like it does for the master hooks. It essentially means "reset to no checkpoint". There are two options for the fix:

      1. Add a reset() method to the OperatorCoordinator.
      2. Call resetToCheckpoint(null) on the OperatorCoordinator.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                sewen Stephan Ewen
                Reporter:
                becket_qin Jiangjie Qin
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: