Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-13370

Offset commit failure percentage metric is not computed correctly (regression)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 2.8.0
    • 3.1.0, 3.0.1, 2.8.2
    • connect, metrics
    • None
    • Confluent Platform Helm Chart (v6.2.0)

    Description

      There seems to have been a regression in the way the offset-commit-* metrics are calculated for source Kafka Connect connectors since version 2.8.0.

      Before this version, any timeout or interruption while trying to commit offsets for source connectors (e.g. MM2 MirrorSourceConnector) would get correctly flagged as an offset commit failure (i.e the offset-commit-failure-percentage metric ** would be non-zero). Since version 2.8.0, these errors are considered as successes.

      After digging through the code, the commit where this bug was introduced appears to be this one : https://github.com/apache/kafka/commit/047ad654da7903f3903760b0e6a6a58648ca7715

      I believe removing the boolean success argument in the recordCommit method of the WorkerTask class (argument deemed redundant because of the presence of the Throwable error argument) and only considering the presence of a non-null error to determine if a commit is a success or failure might be a mistake. This is because in the commitOffsets method of the WorkerSourceTask class, there are multiple cases where an exception object is either not available or is not passed to the recordCommitFailure method, e.g. :

       

      Attachments

        Activity

          People

            showuon Luke Chen
            20100g Vincent Giroux
            Randall Hauch Randall Hauch
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: