Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
4.0.0
Description
Support registering Python data sources.
Users can register a Python data source and later use reference it using its name.
class MyDataSource(DataSource): @classmethod def name(cls): return "my-data-source" spark.dataSource.register(MyDataSource)
Users can then use the name of the data source as the format (will be supported in SPARK-45639)
spark.read.format("my-data-source").load()