Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Done
-
None
-
None
Description
The current cost-based codegen optimizer has a couple of short-comings that lead to missed opportunities for large-scale computation:
- Cost model unaware of distributed operations, which sometimes leads to unnecessary fusion of cell-wise vector operations into distributed row operations, which hurts performance due to additional side inputs that needs to be broadcast to all executors.
- The current identification of interesting points ignores common Cell entries. In order to reason about above problematic, we need to consider cell to row switches, i.e., Cell templates that are merged into row templates as well.
Attachments
Issue Links
- duplicates
-
SYSTEMDS-1533 Extend cost model for distributed operations and broadcasts
- Closed