Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
The following classes should be accessible from Python:
- class DataSource
- class DataSourceDiscovery
- class Dataset
- class ScanContext, ScanOptions, ScanTask
- class ScannerBuilder
- class Scanner
The end result is reading a directory of parquet files as a single stream. One should be able to re-implement https://github.com/apache/arrow/pull/5720 in python.
Attachments
Issue Links
- depends upon
-
ARROW-6242 [C++] Implements basic Dataset/Scanner/ScannerBuilder
- Resolved
- links to