[HIVE-4240] optimize hive.enforce.bucketing and hive.enforce sorting insert - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.11.0
Component/s: Query Processor
Labels:
None

Hadoop Flags:

Reviewed

Description

Consider the following scenario:

set hive.optimize.bucketmapjoin = true;
set hive.optimize.bucketmapjoin.sortedmerge = true;
set hive.input.format = org.apache.hadoop.hive.ql.io.BucketizedHiveInputFormat;
set hive.enforce.bucketing=true;
set hive.enforce.sorting=true;
set hive.exec.reducers.max = 1;
set hive.merge.mapfiles=false;
set hive.merge.mapredfiles=false;

– Create two bucketed and sorted tables
CREATE TABLE test_table1 (key INT, value STRING) PARTITIONED BY (ds STRING) CLUSTERED BY (key) SORTED BY (key) INTO 2 BUCKETS;
CREATE TABLE test_table2 (key INT, value STRING) PARTITIONED BY (ds STRING) CLUSTERED BY (key) SORTED BY (key) INTO 2 BUCKETS;

FROM src
INSERT OVERWRITE TABLE test_table1 PARTITION (ds = '1') SELECT *;

– Insert data into the bucketed table by selecting from another bucketed table
– This should be a map-only operation
INSERT OVERWRITE TABLE test_table2 PARTITION (ds = '1')
SELECT a.key, a.value FROM test_table1 a WHERE a.ds = '1';

We should not need a reducer to perform the above operation.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hive.4240.1.patch
28/Mar/13 10:22
71 kB
Namit Jain
hive.4240.2.patch
29/Mar/13 11:09
174 kB
Namit Jain
hive.4240.3.patch
29/Mar/13 17:09
175 kB
Namit Jain
hive.4240.4.patch
31/Mar/13 16:10
175 kB
Namit Jain
hive.4240.5.patch
02/Apr/13 08:47
176 kB
Namit Jain
hive.4240.5.patch-nohcat
03/Apr/13 04:57
183 kB
Namit Jain

Issue Links

relates to

HIVE-4241 optimize hive.enforce.sorting and hive.enforce bucketing join

Closed

Activity

People

Assignee:: Namit Jain

Reporter:: Namit Jain

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 28/Mar/13 04:41

Updated:: 28/Mar/16 08:21

Resolved:: 03/Apr/13 07:05