Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
Description
aAtable with header. skip.header.line.count=1 does not skip the first line if it is blank, except in a fetch task.
To reproduce, create a csv table, ans set header. skip.header.line.count=1 in table properties.
In the table location, create a single file, with a blank (empty) first line, and say 2 further lines.
If you do a select * on it, you see 2 rows (correct)
If you do select count on it, you get 3 (incorrect)
CREATE EXTERNAL TABLE `testcase1`(id int, name string) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde' LOCATION '${system:test.tmp.dir}/testcase1' TBLPROPERTIES ("skip.header.line.count"="1"); SET hive.fetch.task.conversion = more; select * from testcase1; select count(*) from testcase1; set hive.fetch.task.conversion=none; select * from testcase1; select count(*) from testcase1; Test file: 1,2019-12-31 2,2019-12-31 3,2019-12-31 Should both yield (with the above test file): #### A masked pattern was here #### 1 2019-12-31 2 2019-12-31 3 2019-12-31 3
Attachments
Issue Links
- relates to
-
HIVE-19943 Header values keep showing up in result sets
- Open
-
HIVE-22769 Incorrect query results and query failure during split generation for compressed text files
- Closed
- links to