Details
-
Improvement
-
Status: In Progress
-
Minor
-
Resolution: Unresolved
-
3.1.0
-
None
-
None
Description
SortMergeJoin with partial hash distribution can be optimized to remove shuffle if the hash partitioning expressions are a subset of join keys for both sides.
Attachments
Issue Links
- relates to
-
SPARK-18067 SortMergeJoin adds shuffle if join predicates have non partitioned columns
- Resolved
- links to