Description
Currently sort merge join only supports inner join type (https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/joins/SortMergeJoinExec.scala#L374 ). We added code-gen for other join types internally in our fork and saw obvious CPU performance improvement. Create this Jira to propose to merge back to upstream.
Attachments
1.
|
Refactor sort merge join code-gen be agnostic to join type | Resolved | Cheng Su | |
2.
|
Add code-gen for left/right outer sort merge join | Resolved | Cheng Su | |
3.
|
Add code-gen for left semi sort merge join | Resolved | Cheng Su | |
4.
|
Add code-gen for left anti sort merge join | Resolved | Cheng Su | |
5.
|
Add code-gen for full outer sort merge join | Resolved | Cheng Su | |
6.
|
Add code-gen for existence sort merge join | Resolved | Cheng Su |