[SPARK-22547] Don't include executor ID in metrics name - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: 2.2.0
Fix Version/s: None
Component/s: Spark Core
Labels:
- bulk-closed

Description

Spark's metrics system prefixes all metrics collected from executors with the executor ID.

https://github.com/apache/spark/blob/fccb337f9d1e44a83cfcc00ce33eae1fad367695/core/src/main/scala/org/apache/spark/metrics/MetricsSystem.scala#L136

This behavior causes two problems:

it's not possible to aggregate over executors (since the metric name is different for each host)
upstream metrics systems like Ganglia or Prometheus are put under high load because of the number of time series to store.

By removing the `executorId` from the name of the metric we register, that solves both the above problems

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Li Haoyi

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 17/Nov/17 18:35

Updated:: 21/May/19 04:16

Resolved:: 21/May/19 04:16