[HIVE-8844] Choose a persisent policy for RDD caching [Spark Branch] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: Spark
Labels:
None

Description

RDD caching is used for performance reasons in some multi-insert queries. Currently, we call RDD.cache(), which indicates a persistency policy of using memory only. We should choose a better policy. I think memory+disk will be good enough. Refer to RDD.persist() for more information.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-8844.3-spark.patch
15/Nov/14 17:06
2 kB
Jimmy Xiang
HIVE-8844.2-spark.patch
15/Nov/14 03:15
7 kB
Jimmy Xiang
HIVE-8844.1-spark.patch
15/Nov/14 00:30
7 kB
Jimmy Xiang

Activity

People

Assignee:: Jimmy Xiang

Reporter:: Xuefu Zhang

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 12/Nov/14 19:05

Updated:: 29/May/15 02:31

Resolved:: 15/Nov/14 19:46