Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
Description
Consider the following scenario:
T1: sorted and bucketed by key into 2 buckets
T2: sorted and bucketed by key into 2 buckets
T3: sorted and bucketed by key into 2 buckets
set hive.enforce.sorting=true;
set hive.enforce.bucketing=true;
insert overwrite table T3
select .. from T1 join T2 on T1.key = T2.key;
Since T1, T2 and T3 are sorted/bucketed by the join, and the above join is
being performed as a sort-merge join, T3 should be bucketed/sorted without
the need for an extra reducer.