[SPARK-9067] Memory overflow and open file limit exhaustion for NewParquetRDD+CoalescedRDD - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.3.0, 1.4.0
Fix Version/s: 1.5.0
Component/s: Input/Output
Labels:
None
Environment:

Hide

Target system: Linux, 16 cores, 400Gb RAM
Spark is started locally using the following command:
{{
spark-submit --master local[16] --driver-memory 64G --executor-cores 16 --num-executors 1 --executor-memory 64G
}}

Show
Target system: Linux, 16 cores, 400Gb RAM Spark is started locally using the following command: {{ spark-submit --master local [16] --driver-memory 64G --executor-cores 16 --num-executors 1 --executor-memory 64G }}

Description

If coalesce transformation with small number of output partitions (in my case 16) is applied to large Parquet file (in my has about 150Gb with 215k partitions), then it case OutOfMemory exceptions 250Gb is not enough) and open file limit exhaustion (with limit set to 8k).

The source of the problem is in SqlNewHad\oopRDD.compute method:

val reader = format.createRecordReader(
split.serializableHadoopSplit.value, hadoopAttemptContext)
reader.initialize(split.serializableHadoopSplit.value, hadoopAttemptContext)

// Register an on-task-completion callback to close the input stream.
context.addTaskCompletionListener(context => close())

Created Parquet file reader is intended to be closed at task completion time. This reader contains a lot of references to parquet.bytes.BytesInput object which in turn contains reference sot large byte arrays (some of them are several megabytes).
As far as in case of CoalescedRDD task is completed only after processing larger number of parquet files, it cause file handles exhaustion and memory overflow.

Attachments

Issue Links

links to

[Github] Pull Request #7424 (viirya)

Activity

People

Assignee:: L. C. Hsieh

Reporter:: konstantin knizhnik

Votes:: 4 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 15/Jul/15 09:43

Updated:: 24/Jul/15 19:42

Resolved:: 24/Jul/15 19:41