[FLINK-14228] The runtime support for Bounded[One|Multi]Input#endInput does not properly implement their semantics - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.9.0
Fix Version/s: 1.11.0
Component/s: Runtime / Task
Labels:
None

Description

Currently, the runtime support implementation of Bounded[One|Multi]Input#endInput has the following problems:

The runtime are propagating endInput immediately on the operator chain when input of the head operator is finished. Because some operators flush the buffered data in close, the downstream operators still receive records after executing endInput. This need the operators to flush the buffered data in endInput instead of close, like the PRs for fixing issue#13491 and issue#13376.

Timers are not taken into account.

Actually, StreamOperator#close tells the operator to finish all its processing and flush output (all remaining buffered data), while endInput indicates that no more data will arrive on some input of the operator. That is to say, for the non-tail operators on the operator chain, when the upstream operator is closed, the input of its downstream operator arrives at the end. So for an operator chain {{OP1 -> OP2 -> ... }}, the logic should be:

(Source/Network)Input of OP1 is finished.
call OP1#endInput
quiesce ProcessingTimeService to disallow OP1 from registering new timers.
wait for the pending timers (in processing) of OP1 to finish.
call OP1#close
call OP2#endInput
quiesce ProcessingTimeService to disallow OP2 from registering new timers.
...

Attachments

Issue Links

relates to

FLINK-18647 How to handle processing time timers with bounded input

Open

Sub-Tasks

There are no Sub-Tasks for this issue.

Activity

People

Assignee:: Haibo Sun

Reporter:: Haibo Sun

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 26/Sep/19 11:31

Updated:: 20/Jul/20 15:33

Resolved:: 21/Feb/20 11:28

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

40m

Include sub-tasks