[HIVE-2621] Allow multiple group bys with the same input data and spray keys to be run on the same reducer. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.9.0
Component/s: None
Labels:
None

Description

Currently, when a user runs a query, such as a multi-insert, where each insertion subclause consists of a simple query followed by a group by, the group bys for each clause are run on a separate reducer. This requires writing the data for each group by clause to an intermediate file, and then reading it back. This uses a significant amount of the total CPU consumed by the query for an otherwise simple query.

If the subclauses are grouped by their distinct expressions and group by keys, with all of the group by expressions for a group of subclauses run on a single reducer, this would reduce the amount of reading/writing to intermediate files for some queries.

To do this, for each group of subclauses, in the mapper we would execute a the filters for each subclause 'or'd together (provided each subclause has a filter) followed by a reduce sink. In the reducer, the child operators would be each subclauses filter followed by the group by and any subsequent operations.

Note that this would require turning off map aggregation, so we would need to make using this type of plan configurable.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-2621.1.patch.txt
02/Dec/11 02:35
163 kB
Kevin Wilfong
ASF.LICENSE.NOT.GRANTED--HIVE-2621.D567.4.patch
23/Dec/11 02:55
457 kB
Phabricator
ASF.LICENSE.NOT.GRANTED--HIVE-2621.D567.3.patch
22/Dec/11 01:27
151 kB
Phabricator
ASF.LICENSE.NOT.GRANTED--HIVE-2621.D567.2.patch
15/Dec/11 22:39
162 kB
Phabricator
ASF.LICENSE.NOT.GRANTED--HIVE-2621.D567.1.patch
02/Dec/11 02:36
163 kB
Phabricator

Issue Links

relates to

HIVE-2056 Generate single MR job for multi groupby query if hive.multigroupby.singlemr is enabled.

Closed

Activity

People

Assignee:: Kevin Wilfong

Reporter:: Kevin Wilfong

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 02/Dec/11 02:13

Updated:: 23/Apr/14 21:08

Resolved:: 03/Jan/12 18:10