Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Currently Ozone has some metrics exposed to jmx about the state of the cluster, but it could be improved. Our goal is to provide more information, e.g.
- number of FSO/OBS buckets
- more information about the storage size (like total, EC, RATIS storage size)
- volume/disk failures detected
This list contains the first thoughts, other ideas are welcomed. These need to be persisted, so in case the cluster is restarted it won't be lost. in HDDS-816 OmMetricsInfo was created, which can be used for this.
Also there are OM metrics that are never populated (like S3 related ones, but there can be others as well). We need to go through the existing metrics and rethink them, if needed fix the usage of them.