Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
1.5.3, 1.6.0, 1.7.0
-
None
-
None
Description
This is an umbrella issue to track the effort to harden Flink's MetricQueryService and to separate it from the rest of the system.
The idea is to setup the MetricQueryService and the metric system in general in such a way that it cannot interfere with or even bring the main Flink components down. Moreover, the metric system also should not degrade performance by simply using any free CPU cycles but not more. Ideally, the user does not see a difference between running Flink with metric query service turned on or off.
Attachments
Issue Links
- is related to
-
FLINK-10226 Latency metrics can choke job-manager
- Closed
1.
|
Run MetricQueryService in separate thread pool | Resolved | Shimin Yang | |
2.
|
Handle oversized metric messages | Closed | vinoyang | |
3.
|
Run MetricQueryService with lower priority | Closed | vinoyang | |
4.
|
Run MetricFetcher in metrics ActorSystem | Resolved | Till Rohrmann |