Step 1 of 4: Choose Issues

Cancel

T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Improvement SPARK-47847

Deprecate spark.network.remoteReadNioBufferConversion

Unassigned Cheng Pan Major Open Unresolved  
Improvement SPARK-47835

Remove switch for remoteReadNioBufferConversion

Unassigned Cheng Pan Major Open Unresolved  
Bug SPARK-47678

Got fetch failed exception when new executor reused same ip address from a previously killed executor

Unassigned BoYang Critical Open Unresolved  
Bug SPARK-47148

Avoid to materialize AQE ExchangeQueryStageExec on the cancellation

Unassigned Eren Avsarogullari Major Open Unresolved  
Bug SPARK-46706

percentile_approx regression since Spark 2.4

Unassigned Joey Pereira Major Open Unresolved  
Improvement SPARK-46512

Optimize shuffle reading when both sort and combine are used.

Chenyu Zheng Chenyu Zheng Minor Resolved Fixed  
Improvement SPARK-46319

Implements OffHeapRecyclingBufferPool for ZSTD Compression Buffer Pool

Unassigned Kent Yao Major Resolved Later  
Improvement SPARK-46256

Parallel Compression Support for ZSTD

Kent Yao Kent Yao Major Resolved Fixed  
Bug SPARK-45134

Data duplication may occur when fallback to origin shuffle block

Unassigned gaoyajun02 Critical Open Unresolved  
Epic SPARK-44937

Add SSL/TLS support for RPC and Shuffle communications

Hasnain Lakhani Hasnain Lakhani Major Resolved Fixed  
Bug SPARK-44772

Reading blocks from remote executors causes timeout issue

Unassigned nebi mert aydin Major Resolved Cannot Reproduce  
Bug SPARK-44756

Executor hangs when RetryingBlockTransferor fails to initiate retry

Harunobu Daikoku Harunobu Daikoku Minor Resolved Fixed  
New Feature SPARK-44526

Porting k8s PVC reuse logic to spark standalone

Unassigned Faiz Halde Major Open Unresolved  
Bug SPARK-44215

Client receives zero number of chunks in merge meta response which doesn't trigger fallback to unmerged blocks

Chandni Singh Chandni Singh Major Resolved Fixed  
New Feature SPARK-44209

Expose amount of shuffle data available on the node

Unassigned Deependra Patel Trivial Open Unresolved  
Bug SPARK-44019

Unable to deserialize broadcasted map statuses when executor decommissioned

Unassigned David Klinberg Minor Resolved Duplicate  
Improvement SPARK-43987

Separate finalizeShuffleMerge Processing to Dedicated Thread Pools

SHU WANG SHU WANG Critical Resolved Fixed  
Improvement SPARK-43583

When encryption is enabled on the External Shuffle Service, then processing of push meta requests throws NPE

Chandni Singh Chandni Singh Major Resolved Fixed  
Question SPARK-43408

Spark caching in the context of a single job

Unassigned Faiz Halde Trivial Resolved Invalid  
Improvement SPARK-43179

Add option for applications to control saving of metadata in the External Shuffle Service LevelDB

Chandni Singh Chandni Singh Major Resolved Fixed  
Bug SPARK-42834

Divided by zero occurs in PushBasedFetchHelper.createChunkBlockInfosFromMetaResponse

Unassigned Li Ying Major Closed Won't Do  
Bug SPARK-42784

Fix the problem of incomplete creation of subdirectories in push merged localDir

Fencheng Mei Fencheng Mei Major Resolved Fixed  
Improvement SPARK-42366

Log shuffle data corruption diagnose cause

dzcxzl dzcxzl Minor Resolved Fixed  
Sub-task SPARK-42203

SPARK-33235 JsonProtocol should skip logging of push-based shuffle read metrics when push-based shuffle is disabled

Unassigned Josh Rosen Major Open Unresolved  
Bug SPARK-41792

Shuffle merge finalization removes the wrong finalization state from the DB

Mridul Muralidharan Mridul Muralidharan Minor Resolved Fixed  
Improvement SPARK-41541

Fix wrong child call in SQLShuffleWriteMetricsReporter.decRecordsWritten()

Josh Rosen Josh Rosen Major Resolved Fixed  
Bug SPARK-41458

Correctly transform the SPI services for Yarn Shuffle Service

Cheng Pan Cheng Pan Minor Resolved Fixed  
Task SPARK-41415

SASL Request Retries

Aravind Patnam Aravind Patnam Major Resolved Fixed  
Improvement SPARK-41089

Relocate Netty native arm64 libs

Cheng Pan Cheng Pan Minor Resolved Fixed  
Sub-task SPARK-40872

SPARK-33235 Fallback to original shuffle block when a push-merged shuffle chunk is zero-size

gaoyajun02 gaoyajun02 Major Resolved Fixed  
Bug SPARK-40499

Spark 3.2.1 percentlie_approx query much slower than Spark 2.4.0

Unassigned xuanzhiang Major Resolved Invalid  
Bug SPARK-40480

Remove push-based shuffle data after query finished

Wan Kun Wan Kun Major Resolved Fixed  
Bug SPARK-40412

limit(x,y) + 子查询 出现数据丢失和乱序问题

Unassigned FengJia Major Resolved Invalid  
Improvement SPARK-40083

Add shuffle index cache expire time policy to avoid unused continuous memory consumption

Unassigned wangshengjie Major In Progress Unresolved  
Improvement SPARK-39661

Avoid creating unnecessary SLF4J Logger

BingKun Pan BingKun Pan Trivial Resolved Fixed  
Bug SPARK-39647

Block push fails with java.lang.IllegalArgumentException: Active local dirs list has not been updated by any executor registration even when the NodeManager hasn't been restarted

Chandni Singh Chandni Singh Major Resolved Fixed  
Bug SPARK-39569

Spark Shuffle Index Cache ignore the weight of index Path

Unassigned chen zhejia Major Open Unresolved  
Sub-task SPARK-39200

SPARK-20624 Stream is corrupted Exception while fetching the blocks from fallback storage system

Frank Yin Rajendra Gujja Major Resolved Fixed  
Improvement SPARK-39080

Optimize shuffle error handler

Unassigned Wan Kun Minor In Progress Unresolved  
Improvement SPARK-39072

Fast Fail the remaining push blocks if shuffle stage finalized

Unassigned Wan Kun Minor In Progress Unresolved  
Sub-task SPARK-38987

SPARK-30602 Handle fallback when merged shuffle blocks are corrupted and spark.shuffle.detectCorrupt is set to true

Aravind Patnam Ye Zhou Major Resolved Fixed  
Bug SPARK-38973

When push-based shuffle is enabled, a stage may not complete when retried

Chandni Singh Chandni Singh Major Resolved Fixed  
Bug SPARK-38965

Optimize RemoteBlockPushResolver with a memory pool

Unassigned Wan Kun Minor In Progress Unresolved  
Bug SPARK-38856

Fix a rejectedExecutionException error when push-based shuffle is enabled

weixiuli weixiuli Major Resolved Fixed  
Improvement SPARK-38805

Remove an expired indexFilePath from the ESS shuffleIndexCache or the PBS indexCache to save memory.

Unassigned weixiuli Minor In Progress Unresolved  
Improvement SPARK-38683

It is unnecessary to release the ShuffleManagedBufferIterator or ShuffleChunkManagedBufferIterator or ManagedBufferIterator buffers when the client channel's connection is terminated

weixiuli weixiuli Major Resolved Fixed  
Improvement SPARK-38555

Avoid contention and get or create clientPools quickly in the TransportClientFactory

weixiuli weixiuli Major Resolved Fixed  
Improvement SPARK-38428

Check the FetchShuffleBlocks message only once to improve iteration in external shuffle service

Apache Spark weixiuli Major Resolved Fixed  
Bug SPARK-38344

Avoid to submit task when there are no requests to push up in push-based shuffle

weixiuli weixiuli Major Resolved Fixed  
Bug SPARK-38137

Repartition+Shuffle+ non deterministic function leads to bad results

Unassigned Jakub Leś Major Open Unresolved  

Cancel