Description
I think the first step it to design the interface that we want to present to users. Mostly this is defining options when importing. Off the top of my head:
- What is the separator?
- Provide column names or infer them from the first row.
- how to handle multiple files with possibly different schemas
- do we have a method to let users specify the datatypes of the columns or are they just strings?
- what types of quoting / escaping do we want to support?
Attachments
Issue Links
- is duplicated by
-
SPARK-2378 Implement functionality to read csv files
- Closed
- relates to
-
SPARK-2179 Public API for DataTypes and Schema
- Resolved
- links to