Details
Description
CSV files can be unpredictable. Among other quirks, it is possible for a single CSV record to span multiple lines in a file. In cases like these, TextFileSource is not effective and NLineFileSource is not flexible enough.
The result of this JIRA should be a CSVFileSource which, at minimum, should be able to deal with multiple-line CSV records.
Attachments
Attachments
Issue Links
- is related to
-
CRUNCH-429 The CSVFileSource does not always function properly
- Closed
-
CRUNCH-565 CSVInputFormat needs to be more defensive when configuring itself
- Closed
-
CRUNCH-414 The CSV file source needs to be a little more robust when handling multi-line CSV files
- Closed