Affects Version/s: 1.17.0
Fix Version/s: None
Component/s: Functions - Drill
With a TSV file (demo.tsv.gz in attachment) generated on Windows (EOL = \r\n).
The file contains some special char like
The next request sometimes eat the first char of a line
The string "^/19/2015 9:33:39 AM" doesn't exists. Month is already present in this field in the TSV (so here there is "3/19/2015 9:33:39 AM" in the file demo.tsv).
If '\r\n' are replaced by '\n' with sed before the request, the result is correct as well with lineDelimiter => '\r\n' as lineDelimiter => '\n' or without function TABLE (there is no error and the date is correctly converted with to_timestamp function / columns d is correct in the result_pqt)
keeping '\r\n' and trying to move (in another line in demo.tsv) the line that produce error can prevent error (why ?)
keeping '\r\n' and trying to remove/modify one or more special char (like in "thá»\235i trang jean") can prevent error (why ?)
Didn't manage to reduce more the file demo.tsv while keeping the problem.