Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
Impala 2.9.0
-
None
-
ghx-label-6
Description
We currently default to broadcast join when no stats are available, since the code estimates are both MAX_LONG and in the case of equal costs, broadcast wins. We should consider making partitioned join the default because it will use less memory.
The code is here: https://github.com/apache/incubator-impala/blob/master/fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java#L509
&& (node.getDistributionModeHint() == DistributionMode.BROADCAST || (node.getDistributionModeHint() != DistributionMode.PARTITIONED && broadcastCost <= partitionCost)))
Attachments
Issue Links
- relates to
-
IMPALA-5381 Add query option to control join strategy when tables have no stats
- Resolved