Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-24384

Count checkpoints failed in trigger phase into numberOfFailedCheckpoints

    XMLWordPrintableJSON

Details

    Description

      Problem

      In current implementation, checkpoints failed in trigger phase do not count into metric 'numberOfFailedCheckpoints'. Such that users can not aware checkpoint stoped by this metric.

      As lang as users can use rules like 'numberOfCompletedCheckpoints' not increase in some minutes past (maybe checkpoint interval + timeout) for alerting, but I think it is ambages and can not alert timely.

       

      Proposal

      As the title, count checkpoints failed in trigger phase into 'numberOfFailedCheckpoints'.

      Attachments

        Issue Links

          Activity

            People

              Feifan Wang Feifan Wang
              Feifan Wang Feifan Wang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: