[HIVE-15272] "LEFT OUTER JOIN" Is not populating correct records with Hive On Spark - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.1.0
Fix Version/s: None
Component/s: Hive, Spark
Labels:
None
Environment:

Hive 1.1.0, CentOS, Cloudera 5.7.4

Target Version/s:

2.0.2

Description

I ran following Hive query multiple times with execution engine as Hive on Spark and Hive on MapReduce.

SELECT COUNT(DISTINCT t1.region, t1.amount)
FROM my_db.my_table1 t1
LEFT OUTER
JOIN my_db.my_table2 t2 ON (t1.id = t2.id
                            AND t1.name = t2.name)

With Hive on Spark: Result (count) were different of every execution.
With Hive on MapReduce: Result (count) were same of every execution.

Seems like Hive on Spark behaving differently in each execution and does not populating correct result.

Attachments

Activity

People

Assignee:: Rui Li

Reporter:: Vikash Pareek

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 23/Nov/16 15:05

Updated:: 07/Apr/17 20:08