Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-19300

Timer loss after restoring from savepoint

    XMLWordPrintableJSON

    Details

      Description

      While using heap-based timers, we are seeing occasional timer loss after restoring program from savepoint, especially when using a remote savepoint storage (s3). 

      After some investigation, the issue seems to be related to this line in deserialization. When trying to check the VERSIONED_IDENTIFIER, the input stream may not guarantee filling the byte array, causing timers to be dropped for the affected key group.

      Should keep reading until expected number of bytes are actually read or if end of the stream has been reached. 

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                xianggao Xiang Gao
                Reporter:
                xianggao Xiang Gao
              • Votes:
                1 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: