[HIVE-16241] When PTF, explode and join are used together, result is duplicated - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 2.1.0
Fix Version/s: None
Component/s: PTF-Windowing, Query Processor
Labels:
None

Description

Example is the bellow. Each subquery 'key' column is unique. But when they are joined on 'key' column, a result is duplicated.

CREATE TABLE test (
  key   STRING,
  type  STRING,
  value INT
) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t';

LOAD DATA LOCAL INPATH '/tmp/test.gz' OVERWRITE INTO TABLE test;

SELECT * FROM test;

A	type1	30000
B	type2	20000
C	type2	5000

SELECT l.*
FROM (
  SELECT * FROM test LATERAL VIEW explode(ARRAY(key)) e AS dammy
) l JOIN (
    SELECT key, rank() OVER (PARTITION BY type ORDER BY value DESC) rank 
    FROM test
) r ON l.key = r.key

A	type1	30000	A
A	type1	30000	A
B	type2	20000	B
B	type2	20000	B
C	type2	5000	C
C	type2	5000	C

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Satoshi Iijima

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 17/Mar/17 12:44

Updated:: 17/Mar/17 13:41