Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
1.12.0, 1.12.1
Description
Fix of FLINK-20433 shows potential corruption after recovery for all variations of UnalignedCheckpointITCase.
To reproduce, run UCITCase a couple hundreds times. The issue showed for me in:
- execute [Parallel union, p = 5]
- execute [Parallel union, p = 10]
- execute [Parallel cogroup, p = 5]
- execute [parallel pipeline with remote channels, p = 5]
with decreasing frequency.
The issue manifests as one of the following issues:
- stream corrupted exception
- EOF exception
- assertion failure in NUM_LOST or NUM_OUT_OF_ORDER
- (for union) ArithmeticException overflow (because the number that should be [0;100000] has been mis-deserialized)
Attachments
Issue Links
- causes
-
FLINK-21104 UnalignedCheckpointITCase.execute failed with "IllegalStateException"
- Closed
- is duplicated by
-
FLINK-20309 UnalignedCheckpointTestBase.execute is failed
- Closed
-
FLINK-20662 UnalignedCheckpointITCase.execute failed with IndexOutOfBoundsException
- Closed
-
FLINK-20744 org.apache.flink.test.checkpointing.UnalignedCheckpointITCase fails due to java.lang.ArrayIndexOutOfBoundsException
- Closed
- relates to
-
FLINK-20960 Add warning in 1.12 release notes about potential corrupt data stream with unaligned checkpoint
- Closed
- links to