Description
Require the use of CROSS join syntax in SQL (and a new crossJoin DataFrame API) to specify explicit cartesian products between relations under the default configuration (spark.sql.crossJoin.enabled = false).
By cartesian product we mean a join between relations R and S where there is no join condition involving columns from both R and S.
If a cartesian product is detected in the absence of an explicit CROSS join, an error must be thrown.
Turning on the spark.sql.crossJoin.enabled configuration flag will disable this check and allow cartesian products without an explicit cross join.
Attachments
Issue Links
- is related to
-
SPARK-14854 Left outer join produces incorrect output when the join condition does not have left table key
- Resolved
-
SPARK-18390 Optimized plan tried to use Cartesian join when it is not enabled
- Resolved
- relates to
-
SPARK-18013 R cross join API similar to python and Scala
- Resolved
- links to
1.
|
R cross join API similar to python and Scala | Resolved | Felix Cheung | |
2.
|
Python crossJoin API similar to Scala | Resolved | Srinath |