Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-27187

The attemptsPerUpload metric may be lower than it actually is

    XMLWordPrintableJSON

Details

    Description

      The attemptsPerUpload metric in ChangelogStorageMetricGroup indicate distributions of number of attempts per upload.

      In the current implementation, each successful attempt try to update attemptsPerUpload with its attemptNumber.

      But consider this case: 

      1. attempt 1 timeout, then schedule attempt 2
      2. attempt 1 completed before attempt 2 and update attemptsPerUpload with 1

      In fact there are two attempts, but attemptsPerUpload updated with 1.

      So, I think we should add "actionAttemptsCount" to RetryExecutor.RetriableActionAttempt, this field shared across all attempts to execute the same upload action representing the number of upload attempts. And completed attempt should use this field update attemptsPerUpload.

       

      How do you think about ? ym , roman 

      Attachments

        Issue Links

          Activity

            People

              Feifan Wang Feifan Wang
              Feifan Wang Feifan Wang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: