Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.3.0, 0.6.0
-
None
Description
The query is:
explain INSERT OVERWRITE TABLE t
SELECT t1.r, t2.c, sum(t1.v * t2.v)
FROM t1 join t2 on t1.c = t2.r
GROUP BY t1.r, t2.c;
The FileSinkOperator after the join is serializing all 6 columns from the 2 tables (both have 3 columns: r, c, v) instead of the 4 that is needed later.
Attachments
Issue Links
- is blocked by
-
HIVE-460 Improve ColumnPruner to prune more aggressively and keep column information for input tables
- Closed