Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-37375 Umbrella: Storage Partitioned Join (SPJ)
  3. SPARK-48012

SPJ: Support Transfrom Expressions for One Side Shuffle

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.4.3
    • None
    • SQL

    Description

      SPARK-41471 allowed Spark to shuffle just one side and still conduct SPJ, if the other side is KeyGroupedPartitioning.  However, the support was just for a KeyGroupedPartition without any partition transform (day, year, bucket).  It will be useful to add support for partition transform as well, as there are many tables partitioned by those transforms.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              szehon Szehon Ho
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: