Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
All the selects produce results, except for the last query.
Looking at MM logs, it looks like the inputs are read correctly. Must be something parquet-specific w.r.t. multiple files in a table.
set hive.optimize.index.filter = true; set hive.auto.convert.join=false; CREATE TABLE tbl1(id INT) STORED AS PARQUET; INSERT INTO tbl1 VALUES(1), (2); CREATE TABLE tbl2(id INT, value STRING) STORED AS PARQUET; INSERT INTO tbl2 VALUES(1, 'value1'); INSERT INTO tbl2 VALUES(1, 'value2'); select tbl1.id, t1.value FROM tbl1 JOIN (SELECT * FROM tbl2 WHERE value='value2') t1 ON tbl1.id=t1.id; select tbl1.id, t1.value FROM tbl1 JOIN (SELECT * FROM tbl2 WHERE value='value1') t1 ON tbl1.id=t1.id; select tbl1.id, t1.value, t2.value FROM tbl1 JOIN tbl2 t1 ON tbl1.id=t1.id JOIN tbl2 t2 ON tbl1.id=t2.id select tbl1.id, t1.value, t2.value FROM tbl1 JOIN (SELECT * FROM tbl2 WHERE value='value1') t1 ON tbl1.id=t1.id JOIN (SELECT * FROM tbl2 WHERE value='value2') t2 ON tbl1.id=t2.id;