Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
The datasets API currently allows users to plug in their own file formats. The Substrait consumer only allows parquet & IPC. There is an open Substrait issue to make the file format extensible: https://github.com/substrait-io/substrait/issues/138
Once that feature is added we will need some kind of extension point in our Substrait consumer so that users can register their file format accordingly.