Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-7300

Add KafkaConsumer fetch-error-rate and fetch-error-total metrics

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • clients, consumer, metrics
    • None

    Description

      https://cwiki.apache.org/confluence/display/KAFKA/KIP-356%3A+Add+KafkaConsumer+fetch-error-rate+and+fetch-error-total+metrics

       

      The KafkaConsumer is a complex client that requires many different components to function properly. When a consumer is not operating properly, it can be difficult to identify the root cause and which component is causing issues (ConsumerCoordinator, Fetcher, ConsumerNetworkClient, etc).

       

      This aims to improve the monitoring and detection of KafkaConsumer’s Fetcher component.

       

      Fetcher will send a fetch request for each node that the consumer has assigned partitions for.

       

      This fetch request may fail under the following cases:

      • Intermittent network issues (goes to onFailure)
      • Node sent an invalid full/incremental fetch response (FetchSessionHandler’s handleResponse returns false)
      • FetchSessionIdNotFound
      • InvalidFetchSessionEpochException

       

      These cases are logged, but it would be valuable to provide a corresponding metric that allows for monitoring and alerting.

      Attachments

        Activity

          People

            lu.kevin Kevin Lu
            lu.kevin Kevin Lu
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: