Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.2.0
-
None
Description
Spark's metrics system prefixes all metrics collected from executors with the executor ID.
This behavior causes two problems:
- it's not possible to aggregate over executors (since the metric name is different for each host)
- upstream metrics systems like Ganglia or Prometheus are put under high load because of the number of time series to store.
By removing the `executorId` from the name of the metric we register, that solves both the above problems