Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
0.7.0
-
None
-
None
-
None
Description
Comma Separated Values (CSV) text format are commonly used in exchanging relational data between heterogeneous systems. Currently Hive uses TextFile format when displaying query results. This could cause confusions when column values contain new lines or tabs. A CSVTextFile format could get around this problem. This will require a new CSVTextInputFormat, CSVTextOutputFormat, and CSVSerDe.
A proposed use case is like:
-- exporting a table to CSV files in a directory hive> set hive.io.output.fileformat=CSVTextFile; hive> insert overwrite local directory '/tmp/CSVrepos/' select * from S where ... ; -- query result in CSV hive -e 'set hive.io.output.fileformat=CSVTextFile; select * from T;' | sql_loader_to_other_systems -- query CSV files directory from Hive hive> create table T (...) stored as CSVTextFile; hive> load data local inpath '/my/CSVfiles' into table T; hive> select * from T where ...;
Attachments
Issue Links
- duplicates
-
HIVE-7777 Add CSV Serde based on OpenCSV
- Closed