Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.12.0
Description
In my recent TPCDS benchmark, pipelined region scheduling is slower than lazy-from-sources scheduling.
The regression is due to some suboptimal implementation of PipelinedRegionSchedulingStrategy, including:
1. topologically sorting of vertices to deploy
2. unnecessary O(V) loop when sorting an empty set of regions
After improving these implementations, pipelined region scheduling turned to be 10% faster in the previous benchmark setup.
Attachments
Issue Links
- links to