Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
A query like:
select s.key, s.value from (
select key, value from src2 where key < 10
union all
select key, value from src3 where key < 10
union all
select key, value from src4 where key < 10
union all
select key, count(1) as value from src5 group by key
)s;
should run the last sub-query
'select key, count(1) as value from src5 group by key'
as a map-reduce job.
And then the union should be a map-only job reading from the first 3 map-only subqueries
and the output of the last map-reduce job.
The current plan is very inefficient.
Attachments
Attachments
Issue Links
- relates to
-
HIVE-2634 revert HIVE-2566
- Closed