Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-11705

Eliminate unnecessary Cartesian Join

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Cannot Reproduce
    • None
    • None
    • SQL
    • None

    Description

      When we have some queries similar to following (don’t remember the exact form):
      select * from a, b, c, d where a.key1 = c.key1 and b.key2 = c.key2 and c.key3 = d.key3

      There will be a cartesian join between a and b. But if we just simply change the table order, for example from a, c, b, d, such cartesian join are eliminated.
      Without such manual tuning, the query will never finish if a, b are big. But we should not relies on such manual optimization.

      Attachments

        Activity

          People

            Unassigned Unassigned
            zzhan Zhan Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            13 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: