Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.1.0
-
None
-
None
Description
Datasource V2 does not currently support bucketed reads or writes similar to Datasource V1 does. See DatasourceScanExec and config
spark.sql.sources.bucketing.enabled. We need to add support to V2 as well.
Support writing file data source with bucketing looks like:
fileDf.write.bucketBy(...).sortBy(..)...