[SPARK-10000] Consolidate storage and execution memory management - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Story
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.6.0
Component/s: Block Manager, Spark Core
Labels:
None

Target Version/s:

1.6.0

Description

Memory management in Spark is currently broken down into two disjoint regions: one for execution and one for storage. The sizes of these regions are statically configured and fixed for the duration of the application.

There are several limitations to this approach. It requires user expertise to avoid unnecessary spilling, and there are no sensible defaults that will work for all workloads. As a Spark user, I want Spark to manage the memory more intelligently so I do not need to worry about how to statically partition the execution (shuffle) memory fraction and cache memory fraction. More importantly, applications that do not use caching use only a small fraction of the heap space, resulting in suboptimal performance.

Instead, we should unify these two regions and let one borrow from another if possible.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

unified-memory-management-spark-10000.pdf
08/Oct/15 18:23
379 kB
Andrew Or

Sub-Tasks

1.	Introduce common memory management interface for execution and storage	Resolved	Andrew Or
2.	Implement unified memory manager	Resolved	Andrew Or
3.	Simplify *MemoryManager class structure	Resolved	Josh Rosen
4.	Ensure spilling tests are actually spilling	Resolved	Andrew Or
5.	Document new memory management model	Resolved	Andrew Or

Activity

People

Assignee:: Andrew Or

Reporter:: Reynold Xin

Votes:: 2 Vote for this issue

Watchers:: 31 Start watching this issue

Dates

Created:: 14/Aug/15 20:20

Updated:: 28/Jul/22 14:09

Resolved:: 18/Nov/15 22:11