Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
PyArrow's dev page has a build-and-test section that currently does not list DATASET as a component. Using a recent Arrow version (commit e5e490), I observed DATASET was mandatory for the successful completion of the test suite ran by `python -m pytest pyarrow/`, as recommended on the page. Without `export PYARROW_WITH_DATASET=1`, I observed errors with `test_dataset.py`, `test_exec_plan.py`, and a couple others.
Since DATASET is intended to be an optional component, it should be listed on this section. In addition, the documented test suite command should be updated to one that doesn't fail without the DATASET component being selected (or else the test suite itself should be fixed).
Attachments
Issue Links
- is related to
-
ARROW-16526 [Python] test_partitioned_dataset fails when building with PARQUET but without DATASET
- Resolved
- links to