[FLINK-11742] Push metrics to Pushgateway without "instance" - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: In Progress
Priority: Not a Priority
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: Runtime / Metrics
Labels:

Description

According to the official article,

https://prometheus.io/docs/concepts/jobs_instances/

https://github.com/prometheus/pushgateway

when sending a metric to Prometheus Pushgateway, you need to give an "instance" message.
In actual use, after there is no "instance", Prometheus stores metrics with problems, metrics are not continuous, and a lot of data is lost. After adding instance, it returns to normal.

no "instance"

with "instance"

In Prometheus terms, an endpoint you can scrape is called an instance, usually corresponding to a single process. A collection of instances with the same purpose, a process replicated for scalability or reliability for example, is called a job.

For example, an API server job with four replicated instances:
job: api-server
– instance 1: 1.2.3.4:5670
– instance 2: 1.2.3.4:5671
– instance 3: 5.6.7.8:5670
– instance 4: 5.6.7.8:5671

https://prometheus.io/docs/concepts/jobs_instances/#jobs-and-instances

I think a Flink job corresponds to a Prometheus job, and taskmanager and jobmanager correspond to different instances. If the jobName is used as the instance label, the same metrics of different tasksmanages will conflict, and operations such as sum will fail.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

image-2019-02-25-17-16-28-618.png
25/Feb/19 09:16
92 kB
Tom Goong
image-2019-02-25-17-16-59-034.png
25/Feb/19 09:17
72 kB
Tom Goong

Issue Links

links to

GitHub Pull Request #7820

Activity

People

Assignee:: Unassigned

Reporter:: Tom Goong

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 25/Feb/19 09:17

Updated:: 28/Nov/21 22:38

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m