Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
We currently don't have many metrics to track the behavior of the group coordinator (especially with respect to the new consumer). On a quick pass, I only saw a couple gauges in GroupMetadataManager for the number of groups and the number of cached offsets. Here are some interesting metrics that may be worth tracking:
1. Session timeout rate
2. Rebalance latency/rate
3. Commit latency/rate
4. Average group size
5. Size of metadata cache
Some of these may also be interesting to track per group.