Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
The savepoint coordinator requires that the number of collected states of an operator and the parallelism of the new job (the one to be restored) match exactly. This is too strict.
For example a Kafka source with parallelism 2 and a single Kafka partition does not collect state for one of the two sources (hence it is not part of the savepoint state). Currently, restoring the same job with the same parallelism fails, which should not happen.