[FLINK-14118] Reduce the unnecessary flushing when there is no data available for flush - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.10.0
Component/s: Runtime / Network
Labels:
- pull-request-available

Description

The new flush implementation which works by triggering a netty user event may cause performance regression compared to the old synchronization-based one. More specifically, when there is exactly one BufferConsumer in the buffer queue of subpartition and no new data will be added for a while in the future (may because of just no input or the logic of the operator is to collect some data for processing and will not emit records immediately), that is, there is no data to send, the OutputFlusher will continuously notify data available and wake up the netty thread, though no data will be returned by the pollBuffer method.

For some of our production jobs, this will incur 20% to 40% CPU overhead compared to the old implementation. We tried to fix the problem by checking if there is new data available when flushing, if there is no new data, the netty thread will not be notified. It works for our jobs and the cpu usage falls to previous level.

Attachments

Issue Links

links to

GitHub Pull Request #9706

GitHub Pull Request #9850

Activity

People

Assignee:: Yingjie Cao

Reporter:: Yingjie Cao

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 18/Sep/19 08:30

Updated:: 10/Oct/19 10:37

Resolved:: 10/Oct/19 08:12

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

40m