[HIVE-28489] Partitioning the input data of Grouping Set GroupBy operator - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.1.0
Component/s: Physical Optimizer
Labels:
- hive-4.1.0-must
- pull-request-available

Target Version/s:

4.1.0

Description

GroupBy operator with grouping sets often emits too many rows, which becomes the bottleneck of query execution. To reduce the number output rows, this JIRA proposes partitioning the input data of such GroupBy operator.

Please check out the attached slides for detailed explanation.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

2.PartitionDataBeforeGroupingSet.pdf
31/Oct/24 01:21
505 kB
Seonggon Namgung

Issue Links

links to

GitHub Pull Request #5424

Activity

People

Assignee:: Seonggon Namgung

Reporter:: Seonggon Namgung

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 29/Aug/24 11:11

Updated:: 27/Nov/24 15:41

Resolved:: 27/Nov/24 15:41