Step 1 of 4: Choose Issues

Cancel

T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Sub-task SPARK-48065

SPARK-37375 SPJ: allowJoinKeysSubsetOfPartitionKeys is too strict

Szehon Ho Szehon Ho Major Resolved Fixed  
Sub-task SPARK-48012

SPARK-37375 SPJ: Support Transfrom Expressions for One Side Shuffle

Szehon Ho Szehon Ho Major Resolved Fixed  
Sub-task SPARK-47612

SPARK-37375 Improve picking the side of partially clustered distribution accroding to partition size

Unassigned Qi Zhu Major Open Unresolved  
Sub-task SPARK-47094

SPARK-37375 SPJ : Dynamically rebalance number of buckets when they are not equal

Szehon Ho Himadri Pal Major Resolved Fixed  
Sub-task SPARK-45652

SPARK-37375 SPJ: Handle empty input partitions after dynamic filtering

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-45036

SPARK-37375 SPJ: Refactor logic to handle partially clustered distribution

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-44659

SPARK-37375 SPJ: Include keyGroupedPartitioning in StoragePartitionJoinParams equality check

Unassigned Chao Sun Minor Open Unresolved  
Sub-task SPARK-44647

SPARK-37375 SPJ: Support SPJ when join key is subset of partition keys

Szehon Ho Szehon Ho Major Resolved Fixed  
Sub-task SPARK-44641

SPARK-37375 SPJ: Results duplicated when SPJ partial-cluster and pushdown enabled but conditions unmet

Chao Sun Szehon Ho Blocker Resolved Fixed  
Sub-task SPARK-42454

SPARK-37375 SPJ: encapsulate all SPJ related parameters in BatchScanExec

Szehon Ho Chao Sun Minor Resolved Fixed  
Sub-task SPARK-42040

SPARK-37375 SPJ: Introduce a new API for V2 input partition to report partition size

Qi Zhu Chao Sun Major Resolved Fixed  
Sub-task SPARK-42039

SPARK-37375 SPJ: Remove Option in KeyGroupedPartitioning#partitionValues

Chao Sun Chao Sun Minor Resolved Fixed  
Sub-task SPARK-42038

SPARK-37375 SPJ: Support partially clustered distribution

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-41471

SPARK-37375 SPJ: Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning

Jia Fan Chao Sun Major Resolved Fixed  
Sub-task SPARK-41470

SPARK-37375 SPJ: Spark shouldn't assume InternalRow implements equals and hashCode

Mars Chao Sun Major Resolved Fixed  
Sub-task SPARK-41413

SPARK-37375 SPJ: Avoid shuffle when partition keys mismatch, but join expressions are compatible

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-41398

SPARK-37375 SPJ: Relax constraints on Storage-Partitioned Join when partition keys after runtime filtering do not match

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-40946

SPARK-37375 SPJ: Introduce a new DataSource V2 interface SupportsPushDownClusterKeys

Unassigned Huaxin Gao Major In Progress Unresolved  
Sub-task SPARK-37378

SPARK-37375 SPJ: Convert V2 Transform expressions into catalyst expressions and load their associated functions from V2 FunctionCatalog

Unassigned Chao Sun Major Resolved Duplicate  
Sub-task SPARK-37377

SPARK-37375 SPJ: Initial implementation of Storage-Partitioned Join

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-37376

SPARK-37375 SPJ: Introduce a new DataSource V2 interface HasPartitionKey

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-37166

SPARK-37375 SPIP: Storage Partitioned Join

Chao Sun Chao Sun Major Resolved Fixed  

Cancel