[SPARK-1991] Support custom StorageLevels for vertices and edges - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: GraphX
Labels:
None

Description

Large graphs may not fit entirely in memory. If we supported custom storage levels for the vertices and edges of a graph, the user could specify MEMORY_AND_DISK and then repartition the graph to use many small partitions, each of which does fit in memory. Spark would then automatically load partitions from disk as needed.

Also, the replicated storage levels would be helpful for fault tolerance, and the serialized ones would improve efficiency for non-primitive vertex and edge attributes.

Attachments

Activity

People

Assignee:: Ankur Dave

Reporter:: Ankur Dave

Votes:: 1 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 02/Jun/14 02:56

Updated:: 03/Jun/14 21:57

Resolved:: 03/Jun/14 21:57