[SPARK-3000] Drop old blocks to disk in parallel when memory is not large enough for caching new blocks - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Auto Closed
Affects Version/s: 1.1.0
Fix Version/s: None
Component/s: Block Manager, Spark Core
Labels:
- bulk-closed

Description

In spark, rdd can be cached in memory for later use, and the cached memory size is "spark.executor.memory * spark.storage.memoryFraction" for spark version before 1.1.0, and "spark.executor.memory * spark.storage.memoryFraction * spark.storage.safetyFraction" after SPARK-1777.

For Storage level MEMORY_AND_DISK, when free memory is not enough to cache new blocks, old blocks might be dropped to disk to free up memory for new blocks. This operation is processed by ensureFreeSpace in MemoryStore.scala, there will always be a "accountingLock" held by the caller to ensure only one thread is dropping blocks. This method can not fully used the disks throughput when there are multiple disks on the working node. When testing our workload, we found this is really a bottleneck when size of old blocks to be dropped is really large.

We have tested the parallel method on spark 1.0, the speedup is significant. So it's necessary to make dropping blocks operation in parallel.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Spark-3000 Design Doc.pdf
06/Nov/14 10:29
710 kB
Zhang, Liye

Issue Links

is duplicated by

SPARK-1888 enhance MEMORY_AND_DISK mode by dropping blocks in parallel

Resolved

links to

[Github] Pull Request #2134 (liyezhang556520)

[Github] Pull Request #11874 (JoshRosen)

Activity

People

Assignee:: Josh Rosen

Reporter:: Zhang, Liye

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 13/Aug/14 05:58

Updated:: 06/Jun/19 13:58

Resolved:: 06/Jun/19 13:58