Details
-
Umbrella
-
Status: Open
-
Major
-
Resolution: Unresolved
-
4.0.0
-
None
Description
verifySchema parameter of createDataFrame decides whether to verify data types of every row against schema.
In Spark Classic
Now it only takes effect for with createDataFrame with
- regular Python instances
We propose to make it work with createDataFrame with
- pyarrow.Table
- pandas.DataFrame with Arrow optimization
- pandas.DataFrame without Arrow optimization
In Spark Connect
Now it does not take effect.
We propose to make it work with all inputs.
Attachments
Issue Links
- links to
1.
|
Standardize verifySchema parameter of createDataFrame in Spark Classic | Open | Unassigned | |
2.
|
Deprecate "spark.sql.execution.pandas.convertToArrowArraySafely" configuration | Open | Unassigned | |
3.
|
Implement verifySchema parameter of createDataFrame in Spark Connect | Open | Unassigned |