Details
Description
Currently Spark allows users to use external data sources like spark-avro, spark-csv, etc by having them specifying their full class name:
sqlContext.read.format("com.databricks.spark.avro").load(path)
Typing in a full class is not the best idea so it would be nice to allow the external packages to be able to register themselves with Spark to allow users to do something like:
sqlContext.read.format("avro").load(path)
This would make it so that the external data source packages follow the same convention as the built in data sources do, parquet, json, jdbc, etc.
This could be accomplished by using a ServiceLoader.
Attachments
Issue Links
- links to