[SPARK-3056] Sort-based Aggregation - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.5.0
Component/s: SQL
Labels:
None

Description

Currently, SparkSQL only support the hash-based aggregation, which may cause OOM if too many identical keys in the input tuples.

Attachments

Issue Links

is blocked by

SPARK-2926 Add MR-style (merge-sort) SortShuffleReader for sort-based shuffle

Resolved

is duplicated by

SPARK-2873 Support disk spilling in Spark SQL aggregation

Resolved

links to

[Github] Pull Request #7458 (yhuai)

Activity

People

Assignee:: Yin Huai

Reporter:: Cheng Hao

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 15/Aug/14 00:54

Updated:: 28/Jul/15 20:08

Resolved:: 22/Jul/15 06:27