[HIVE-7767] hive.optimize.union.remove does not work properly [Spark Branch] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: None
Labels:
None

Description

Turing on the hive.optimize.union.remove property generates wrong union all result.

For Example:

create table inputTbl1(key string, val string) stored as textfile;
load data local inpath '../../data/files/T1.txt' into table inputTbl1;
SELECT *
FROM (
  SELECT key, count(1) as values from inputTbl1 group by key
  UNION ALL
  SELECT key, count(1) as values from inputTbl1 group by key
) a;

when the hive.optimize.union.remove is turned on, the query result is like:

when the hive.optimize.union.remove is turned off, the query result is like:

The expected query result is:

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-7767.1-spark.patch
20/Aug/14 03:11
267 kB
Na Yang
HIVE-7767.2-spark.patch
20/Aug/14 04:57
278 kB
Na Yang
HIVE-7767.2-spark.patch
20/Aug/14 21:17
278 kB
Brock Noland
HIVE-7767.3-spark.patch
20/Aug/14 22:24
246 kB
Na Yang

Issue Links

Is contained by

HIVE-7292 Hive on Spark

Resolved

is related to

HIVE-7541 Support union all on Spark [Spark Branch]

Resolved

HIVE-7717 Add .q tests coverage for "union all" [Spark Branch]

Resolved

Activity

People

Assignee:: Na Yang

Reporter:: Na Yang

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 18/Aug/14 17:34

Updated:: 29/May/15 02:32

Resolved:: 21/Aug/14 00:09