-
Type:
Improvement
-
Status: Open
-
Priority:
Major
-
Resolution: Unresolved
-
Affects Version/s: 1.15.0
-
Fix Version/s: None
-
Component/s: Execution - Relational Operators
-
Labels:None
When the probe-side data for a hash join is skewed, it is preferable to have the corresponding partition on the build side to be in memory.
Currently, with the spill-to-disk feature, the partition selected for spilling to disk is done at random. This means that a highly skewed probe-side data would also spill for lack of a corresponding hash table partition in memory.
- is related to
-
DRILL-6444 Hash Join: Avoid partitioning when memory is sufficient
-
- Open
-