[SPARK-15463] Support for creating a dataframe from CSV in Dataset[String] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.0.0
Fix Version/s: 2.2.0
Component/s: SQL
Labels:
None

Description

I currently use Databrick's spark-csv lib but some features don't work with Apache Spark 2.0.0-SNAPSHOT. I understand that with the addition of CSV support into spark-sql directly, that spark-csv won't be modified.
I currently read some CSV data that has been pre-processed and is in RDD[String] format.
There is sqlContext.read.json(rdd: RDD[String]) but other formats don't appear to support the creation of DataFrames based on loading from RDD[String].

Attachments

Issue Links

is related to

SPARK-15615 Support for creating a dataframe from JSON in Dataset[String]

Resolved

SPARK-22505 toDF() / createDataFrame() type inference doesn't work as expected

Resolved

links to

[Github] Pull Request #13300 (xwu0226)

[Github] Pull Request #16854 (HyukjinKwon)

Activity

People

Assignee:: Hyukjin Kwon

Reporter:: PJ Fanning

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Dates

Created:: 21/May/16 13:41

Updated:: 12/Dec/22 18:11

Resolved:: 08/Mar/17 21:43