[SPARK-12686] Support group-by push down into data sources - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Incomplete
Affects Version/s: 1.6.0
Fix Version/s: None
Component/s: SQL
Labels:
- bulk-closed

Description

As for logical plan nodes like 'Aggregate -> Project -> (Filter) -> Scan', we can push down partial aggregation processing into data sources that could aggregate their own data efficiently because Orc/Parquet could fetch the MIN/MAX value by using statistics data and some databases have efficient aggregation implementations.

Attachments

Issue Links

relates to

SPARK-12449 Pushing down arbitrary logical plans to data sources

Resolved

links to

[Github] Pull Request #10631 (maropu)

[Github] Pull Request #18018 (kisimple)

Activity

People

Assignee:: Unassigned

Reporter:: Takeshi Yamamuro

Votes:: 1 Vote for this issue

Watchers:: 14 Start watching this issue

Dates

Created:: 07/Jan/16 05:40

Updated:: 21/May/19 04:12

Resolved:: 21/May/19 04:12