Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-2202

ConsumerPerformance reports a throughput much higher than the actual one

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 0.8.2.0
    • 0.9.0.0
    • tools
    • None

    Description

      I've been using the kafka.tools.ConsumerPerformance tool for some benchmarking until in one of my tests I got a throughput much higher than the supported by my network interface.
      The test consisted in consuming around ~4900 MB from one topic using one consumer with one thread. The reported throughput reported was ~1400 MB/s which surpasses the 10 Gbps of the network. The time for the whole operation was ~8 seconds, which should correspond to a throughput of ~612 MB/s.
      Digging the ConsumerPerformance code, I've found this at line 73:

      val elapsedSecs = (endMs - startMs - config.consumerConfig.consumerTimeoutMs) / 1000.0
      

      The consumerTimeoutMs defined as 5000 at line 131 is always considered leading to wrong results.

      This bug seems to be related to this one https://issues.apache.org/jira/browse/KAFKA-1828

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            omkreddy Manikumar
            capitao Micael Capitão
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment