Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-32461 Shuffled hash join improvement
  3. SPARK-36612

Support left outer join build left or right outer join build right in shuffled hash join

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.5.0
    • SQL
    • None

    Description

      Currently spark sql does not support build left side when left outer join (or build right side when right outer join).

      However, in our production environment, there are a large number of scenarios where small tables are left join large tables, and many times, large tables have data skew (currently AQE can't handle this kind of skew).

      Inspired by SPARK-32399, we can use similar ideas to realize left outer join build left.

      I think this treatment is very meaningful, but I don’t know how members consider this matter?

      Attachments

        Activity

          People

            szehon Szehon Ho
            mcdull_zhang mcdull_zhang
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: