[PHOENIX-5258] Add support to parse header from the input CSV file as input columns for CsvBulkLoadTool - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Patch Available
Priority: Minor
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: 5.3.0
Component/s: None
Labels:
None

Description

Currently, CsvBulkLoadTool does not support reading header from the input csv and expects the content of the csv to match with the table schema. The support for the header can be added to dynamically map the schema with the header.

The proposed solution is to introduce another option for the tool `–parse-header`. If this option is passed, the input columns list is constructed by reading the first line of the input CSV file.

If there is only one file, read the header from the first line and generate the `ColumnInfo` list.
If there are multiple files, read the header from all the files, and throw an error if the headers across files do not match.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PHOENIX-5258-master.patch
06/May/19 18:33
19 kB
Prashant Vithani
PHOENIX-5258-master.001.patch
14/May/19 08:29
26 kB
Prashant Vithani
PHOENIX-5258-4.x-HBase-1.4.patch
06/May/19 18:23
19 kB
Prashant Vithani
PHOENIX-5258-4.x-HBase-1.4.001.patch
14/May/19 08:35
26 kB
Prashant Vithani

Issue Links

links to

GitHub Pull Request #498

Activity

People

Assignee:: Prashant Vithani

Reporter:: Prashant Vithani

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 23/Apr/19 13:09

Updated:: 23/Jun/22 20:04

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

40m