Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-3556

Improve group coordinator metrics

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      We currently don't have many metrics to track the behavior of the group coordinator (especially with respect to the new consumer). On a quick pass, I only saw a couple gauges in GroupMetadataManager for the number of groups and the number of cached offsets. Here are some interesting metrics that may be worth tracking:

      1. Session timeout rate
      2. Rebalance latency/rate
      3. Commit latency/rate
      4. Average group size
      5. Size of metadata cache

      Some of these may also be interesting to track per group.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            liquanpei Liquan Pei
            hachikuji Jason Gustafson

            Dates

              Created:
              Updated:

              Slack

                Issue deployment