Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
It would be simpler to assess the raw performance of parquet-cpp Parquet scans to have something akin to https://github.com/apache/parquet-cpp/blob/master/tools/parquet-scan.cc available as a callable Python function. This way we can isolate the performance of scanning the file from converting it to Arrow (and to pandas)
Attachments
Issue Links
- depends upon
-
PARQUET-1083 [C++] Refactor core logic in parquet-scan.cc so that it can be used as a library function for benchmarking
- Resolved