Details
-
Epic
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.2.0
-
None
Description
This is an umbrella ticket tracking the general effort to improve performance and interoperability between PySpark and Pandas. The core idea is to Apache Arrow as serialization format to reduce the overhead between PySpark and Pandas.
Attachments
Issue Links
- incorporates
-
SPARK-21187 Complete support for remaining Spark data types in Arrow Converters
- Resolved