[SPARK-15690] Fast single-node (single-process) in-memory shuffle - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: None
Fix Version/s: None
Component/s: Shuffle, Spark Core, SQL
Labels:
- bulk-closed

Description

Spark's current shuffle implementation sorts all intermediate data by their partition id, and then write the data to disk. This is not a big bottleneck because the network throughput on commodity clusters tend to be low. However, an increasing number of Spark users are using the system to process data on a single-node. When in a single node operating against intermediate data that fits in memory, the existing shuffle code path can become a big bottleneck.

The goal of this ticket is to change Spark so it can use in-memory radix sort to do data shuffling on a single node, and still gracefully fallback to disk if the data size does not fit in memory. Given the number of partitions is usually small (say less than 256), it'd require only a single pass do to the radix sort with pretty decent CPU efficiency.

Note that there have been many in-memory shuffle attempts in the past. This ticket has a smaller scope (single-process), and aims to actually productionize this code.

Attachments

Issue Links

relates to

SPARK-3376 Memory-based shuffle strategy to reduce overhead of disk I/O

Resolved

Activity

People

Assignee:: Unassigned

Reporter:: Reynold Xin

Votes:: 7 Vote for this issue

Watchers:: 28 Start watching this issue

Dates

Created:: 01/Jun/16 05:43

Updated:: 17/May/20 18:30

Resolved:: 08/Oct/19 05:41