Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-16582

[Python] Include DATASET in list of components in PyArrow's dev page

    XMLWordPrintableJSON

Details

    Description

      PyArrow's dev page has a build-and-test section that currently does not list DATASET as a component. Using a recent Arrow version (commit e5e490), I observed DATASET was mandatory for the successful completion of the test suite ran by `python -m pytest pyarrow/`, as recommended on the page. Without `export PYARROW_WITH_DATASET=1`, I observed errors with `test_dataset.py`, `test_exec_plan.py`, and a couple others.

      Since DATASET is intended to be an optional component, it should be listed on this section. In addition, the documented test suite command should be updated to one that doesn't fail without the DATASET component being selected (or else the test suite itself should be fixed).

      Attachments

        Issue Links

          Activity

            People

              raulcd Raúl Cumplido
              rtpsw Yaron Gvili
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 50m
                  50m