Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-39172

Remove outer join if all output come from streamed side and buffered side keys exist unique key

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.4.0
    • 3.4.0
    • SQL
    • None

    Description

      Improve the optimzation case using the distinct keys framework.

      For example:

      SELECT t1.* FROM t1 LEFT JOIN (SELECT distinct c1 as c1 FROM t)t2 ON t1.c1 = t2.c1
      ==>
      SELECT t1.* FROM t1 

      Attachments

        Activity

          People

            ulysses XiDuo You
            ulysses XiDuo You
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: