Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-17298

Require explicit CROSS join for cartesian products by default

    XMLWordPrintableJSON

Details

    • Story
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.1.0
    • SQL
    • None

    Description

      Require the use of CROSS join syntax in SQL (and a new crossJoin DataFrame API) to specify explicit cartesian products between relations under the default configuration (spark.sql.crossJoin.enabled = false).
      By cartesian product we mean a join between relations R and S where there is no join condition involving columns from both R and S.

      If a cartesian product is detected in the absence of an explicit CROSS join, an error must be thrown.
      Turning on the spark.sql.crossJoin.enabled configuration flag will disable this check and allow cartesian products without an explicit cross join.

      Attachments

        Issue Links

          Activity

            People

              vssrinath Srinath
              vssrinath Srinath
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: