[HIVE-3552] HIVE-3552 performant manner for performing cubes/rollups/grouping sets for a high number of grouping set keys - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.11.0
Component/s: Query Processor
Labels:
None

Description

This is a follow up for ~~HIVE-3433~~.

Had a offline discussion with Sambavi - she pointed out a scenario where the
implementation in ~~HIVE-3433~~ will not scale. Assume that the user is performing
a cube on many columns, say '8' columns. So, each row would generate 256 rows
for the hash table, which may kill the current group by implementation.

A better implementation would be to add an additional mr job - in the first
mr job perform the group by assuming there was no cube. Add another mr job, where
you would perform the cube. The assumption is that the group by would have
decreased the output data significantly, and the rows would appear in the order of
grouping keys which has a higher probability of hitting the hash table.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hive.3552.9.patch
19/Dec/12 08:07
226 kB
Namit Jain
hive.3552.8.patch
19/Dec/12 06:46
226 kB
Namit Jain
hive.3552.7.patch
19/Dec/12 04:33
221 kB
Namit Jain
hive.3552.6.patch
18/Dec/12 07:40
221 kB
Namit Jain
hive.3552.5.patch
18/Dec/12 05:07
221 kB
Namit Jain
hive.3552.4.patch
12/Dec/12 15:17
219 kB
Namit Jain
hive.3552.3.patch
12/Dec/12 10:08
219 kB
Namit Jain
hive.3552.2.patch
05/Dec/12 17:31
179 kB
Namit Jain
hive.3552.12.patch
03/Jan/13 16:44
226 kB
Namit Jain
hive.3552.11.patch
22/Dec/12 05:41
226 kB
Namit Jain
hive.3552.10.patch
20/Dec/12 04:25
226 kB
Namit Jain
hive.3552.1.patch
28/Nov/12 08:37
180 kB
Namit Jain

Issue Links

depends upon

HIVE-3433 Implement CUBE and ROLLUP operators in Hive

Closed

Activity

People

Assignee:: Namit Jain

Reporter:: Namit Jain

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 08/Oct/12 20:58

Updated:: 31/Jan/15 19:58

Resolved:: 09/Jan/13 17:59