[SPARK-14726] Support for sampling when inferring schema in CSV data source - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Won't Fix
Affects Version/s: 2.0.0
Fix Version/s: None
Component/s: SQL
Labels:
None

Description

Currently, I am using CSV data source and trying to get used to Spark 2.0 because it has built-in CSV data source.

I realized that CSV data source infers schema with all the data. JSON data source supports sampling ratio option.

It would be great if CSV data source has this option too (or is this supported already?).

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Bomi Kim

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 19/Apr/16 08:32

Updated:: 12/Dec/22 18:11

Resolved:: 04/Apr/17 01:21