[SPARK-2650] Caching tables larger than memory causes OOMs - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: 1.0.0, 1.0.1
Fix Version/s: 1.1.0
Component/s: SQL
Labels:
None

Target Version/s:

1.1.0

Description

The logic for setting up the initial column buffers is different for Spark SQL compared to Shark and I'm seeing OOMs when caching tables that are larger than available memory (where shark was okay).

Two suspicious things: the intialSize is always set to 0 so we always go with the default. The default looks like it was copied from code like 10 * 1024 * 1024... but in Spark SQL its 10 * 102 * 1024.

Attachments

Issue Links

is related to

SPARK-2902 Change default options to be more agressive

Resolved

links to

[Github] Pull Request #1769 (liancheng)

[Github] Pull Request #1880 (marmbrus)

[Github] Pull Request #1901 (liancheng)

Activity

People

Assignee:: Michael Armbrust

Reporter:: Michael Armbrust

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 23/Jul/14 18:13

Updated:: 12/Aug/14 06:12

Resolved:: 12/Aug/14 03:22