[HIVE-2329] Not using map aggregation, fails to execute group-by after cluster-by with same key - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.9.0
Component/s: Query Processor
Labels:
None

Hadoop Flags:

Reviewed

Description

hive.map.aggr=false
select Q1.key_int1, sum(Q1.key_int1), sum(distinct Q1.key_int1) from (select * from t1 cluster by key_int1) Q1 group by Q1.key_int1

resulted..

FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask

from hadoop logs..

Caused by: java.lang.RuntimeException: cannot find field key from []
at org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorUtils.getStandardStructFieldRef(ObjectInspectorUtils.java:321)
at org.apache.hadoop.hive.serde2.objectinspector.StandardStructObjectInspector.getStructFieldRef(StandardStructObjectInspector.java:119)
at org.apache.hadoop.hive.ql.exec.ExprNodeColumnEvaluator.initialize(ExprNodeColumnEvaluator.java:82)
at org.apache.hadoop.hive.ql.exec.GroupByOperator.initializeOp(GroupByOperator.java:198)
at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:357)
at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:433)
........

I think the problem is caused by ReduceSinkDeDuplication, removing RS which was providing rs.key for GBY operation. If child of child RS is a GBY, we should bypass the optimization.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

ASF.LICENSE.NOT.GRANTED--HIVE-2329.D657.1.patch
08/Dec/11 17:59
11 kB
Phabricator
HIVE-2329.1.patch.txt
07/Dec/11 08:13
3 kB
Navis Ryu

Issue Links

relates to

HIVE-2732 Reduce Sink deduplication fails if the child reduce sink is followed by a join

Closed

Activity

People

Assignee:: Navis Ryu

Reporter:: Navis Ryu

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 01/Aug/11 08:48

Updated:: 30/Apr/12 21:11

Resolved:: 09/Dec/11 17:58