Description
I am investigating right now.
The issue is specific to Parquet:
set hive.cbo.enable=true; drop table staging; drop table parquet_ctas; create table staging (key int, value string) stored as textfile; insert into table staging select * from src order by key limit 10; select * from staging; create table parquet_ctas stored as parquet as select * from staging; select * from parquet_ctas; create table orc_ctas stored as orc as select * from staging; select * from orc_ctas; create table txt_ctas stored as textfile as select * from staging; select * from txt_ctas;
The parquet query returns all NULLs with CBO on.