Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
4.0.1
-
None
Description
Arrow Datasets currently support Parquet but not ORC. ORC support needs to be added.
Attachments
Issue Links
- is related to
-
ARROW-13572 [C++][Python] Add basic ORC support to the pyarrow.datasets API
- Resolved
1.
|
[C++][Python] Add basic ORC support to the pyarrow.datasets API | Resolved | Joris Van den Bossche |
|
||||||||
2.
|
[C++] Add async version of the ORC Dataset scanner | Open | Unassigned | |||||||||
3.
|
[C++] Add write support for ORC in the Datasets API | Open | Unassigned | |||||||||
4.
|
[C++] Implement column projection pushdown to ORC reader in Datasets API | Resolved | Joris Van den Bossche |
|
||||||||
5.
|
[C++] Add support for batch_size in the ORC Scanner (Dataset) | Resolved | zhixingheyi-tian |
|
||||||||
6.
|
[C++][Dataset] Support Count function without projections in ORC to avoid loading all columns | Open | Unassigned | |||||||||
7.
|
[C++][Dataset] Add support for filter pushdown in the ORC Scanner | Open | Unassigned |