[FLINK-21915] Optimize Execution#finishPartitionsAndUpdateConsumers - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Later
Affects Version/s: 1.13.0
Fix Version/s: None
Component/s: Runtime / Coordination
Labels:
- pull-request-available

Description

Based on the scheduler benchmark PartitionReleaseInBatchJobBenchmark introduced in ~~FLINK-20612~~, we find that there's another procedure that has O(N^2) computation complexity: Execution#finishPartitionsAndUpdateConsumers.

Once an execution is finished, it will finish all its BLOCKING partitions and update the partition info to all consumer vertices. The procedure can be illustrated as the following pseudo code:

for all Execution in ExecutionGraph:
  for all produced IntermediateResultPartition of the Execution:
    for all consumer ExecutionVertex of the IntermediateResultPartition:
      update or cache partition info

This procedure has O(N^2) complexity in total.

Based on ~~FLINK-21326~~, the consumed partitions are grouped if they are connected to the same consumer vertices. Therefore, we can update partition info of the entire ConsumedPartitionGroup in batch, rather than one by one. This will decrease the complexity from O(N^2) to O(N).

Attachments

Issue Links

links to

GitHub Pull Request #15382

Activity

People

Assignee:: Unassigned

Reporter:: Zhilong Hong

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 22/Mar/21 11:35

Updated:: 28/Aug/21 13:07

Resolved:: 03/Aug/21 07:19