[SPARK-40854] Change default serialization from 'broken' CSV to Spark DF JSON - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.4.0
Fix Version/s: 3.4.0
Component/s: Connect
Labels:
None

Description

Currently, Spark Connect uses a mediocre version of CSV serialization for the results from the server side without proper batching.

We should at least migrate to the existing Spark DF JSON serialization and batch the results.

Attachments

Issue Links

links to

[Github] Pull Request #38300 (grundprinzip)

[Github] Pull Request #38300 (grundprinzip)

Activity

People

Assignee:: Martin Grund

Reporter:: Martin Grund

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 20/Oct/22 08:12

Updated:: 12/Dec/22 18:10

Resolved:: 21/Oct/22 11:49