Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-8042

Retry individual failover-strategy for some time first before reverting to full job restart

    XMLWordPrintableJSON

Details

    Description

      Let's say we lost a taskmanager node. When Flink tries to attempt fine grained recovery and fails replacement taskmanager node didn't come back in time, it reverts to full job restart.

      Stephan and Till was suggesting that Flink can/should retry fine grained recovery for some time before giving up and reverting full job restart

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              stevenz3wu Steven Zhen Wu
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: