Description
This JIRA is a sub-task of SPARK-17626.
The objective is to provide a consistent performance improvement for star schema queries. Star schema consists of one or more fact tables referencing a number of dimension tables. In general, queries against star schema are expected to run fast because of the established RI constraints among the tables. This design proposes a join reordering based on natural, generally accepted heuristics for star schema queries:
- Finds the star join with the largest fact table and places it on the driving arm of the left-deep join. This plan avoids large tables on the inner, and thus favors hash joins.
- Applies the most selective dimensions early in the plan to reduce the amount of data flow.
The design description is included in the below attached document.
Attachments
Attachments
Issue Links
- is related to
-
SPARK-16026 Cost-based Optimizer Framework
- Resolved
- links to