[BEAM-9030] Bump grpc to 1.26.0 - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: P2
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.20.0
Component/s: java-fn-execution, runner-flink
Labels:
None

Description

When submitting a Python word count job to a Flink session/standalone cluster repeatedly, the meta space usage of the task manager of the Flink cluster will continuously increase (about 40MB each time). The reason is that the Beam classes are loaded with the user class loader in Flink and there are problems with the implementation of `ProcessManager`(from Beam) and `ThreadPoolCache`(from netty) which may cause the user class loader could not be garbage collected even after the job finished which causes the meta space memory leak eventually. You can refer to ~~FLINK-15338~~[1] for more information.

Regarding to `ProcessManager`, I have created a JIRA ~~BEAM-9006~~[2] to track it. Regarding to `ThreadPoolCache`, it is a Netty problem and has been fixed in NETTY#8955[3]. Netty 4.1.35 Final has already included this fix and GRPC 1.22.0 has already dependents on Netty 4.1.35 Final. So we need to bump the version of GRPC to 1.22.0+ (currently 1.21.0).

What do you think?

[1] https://issues.apache.org/jira/browse/FLINK-15338
[2] https://issues.apache.org/jira/browse/BEAM-9006
[3] https://github.com/netty/netty/pull/8955

Attachments

Issue Links

relates to

BEAM-9252 Problem shading Beam pipeline with Beam 2.20.0-SNAPSHOT

Resolved

BEAM-9006 Meta space memory leak caused by the shutdown hook of ProcessManager

Resolved

links to

GitHub Pull Request #10463

GitHub Pull Request #10578

GitHub Pull Request #10602

Activity

People

Assignee:: sunjincheng

Reporter:: sunjincheng

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 24/Dec/19 05:44

Updated:: 09/Oct/20 17:28

Resolved:: 16/Jan/20 00:36

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

8h 50m