Details
Description
We are using the tblproperties ("skip.header.line.count"="1") when creating an external table.
When we do a select * from table we get it back as expected without the header present in the result set.
However when we do for instance a count(1) we get the header back in this count (tested with a select * from table and paste it in notepad to find the amount of rows)
If we also do this with a select distinct(column) from table we also get the header as a distinct value.
file structure:
_TESTING_TYPE |
---|
adf |
hyg |
abc |
Update: 26/06/2018
Create statement:
----------------------------------- --test_type-- ----------------------------------- CREATE EXTERNAL TABLE IF NOT EXISTS ext.test_type_in ( test_type string ) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\073' STORED AS TEXTFILE LOCATION 'adl://{adlslocation}data/data2/test' tblproperties ("skip.header.line.count"="1")
Select statement:
select * from test_type_in;
Distinct statement:
select distinct test_type from test_type_in ORDER BY test_type;
I cannot show the exact statement because of NDA so i changed those values to test.
I can also tell you it is not just at our HDInsight but also at another company we are working for. It does not Mather what is in the data as well. so for testing purposes:
test_type,abcg,gjeiza,aze,grriajj,gd,rrjri,vdju
Attachments
Issue Links
- is related to
-
HIVE-25505 Incorrect results with header. skip.header.line.count if first line is blank
- Closed