Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-33735 FLIP-364: Improve the exponential-delay restart-strategy
  3. FLINK-32895

Introduce the max attempts for Exponential Delay Restart Strategy

    XMLWordPrintableJSON

Details

    Description

      Currently, Flink has 3 restart strategies, they are: fixed-delay, failure-rate and exponential-delay.

      The exponential-delay is suitable if a job continues to fail for a period of time. The fixed-delay and failure-rate has the max attempts mechanism, that means, the job won't restart and go to fail after the attempt exceeds the threshold of max attempts.

      The max attempts mechanism is reasonable, flink should not or need to infinitely restart the job if the job keeps failing. However, the exponential-delay doesn't have the max attempts mechanism.

      I propose introducing the `restart-strategy.exponential-delay.max-attempts-before-reset` to support the max attempts mechanism for exponential-delay. It means flink won't restart job if the number of job failures before reset exceeds max-attempts-before-reset when is exponential-delay is enabled.

      Attachments

        Issue Links

          Activity

            People

              fanrui Rui Fan
              fanrui Rui Fan
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: