Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-3764 [C++] Port Python "ParquetDataset" business logic to C++
  3. ARROW-8039

[Python][Dataset] Support using dataset API in pyarrow.parquet with a minimal ParquetDataset shim

    XMLWordPrintableJSON

Details

    Description

      Assemble a minimal ParquetDataset shim backed by pyarrow.dataset.*. Replace the existing ParquetDataset with the shim by default, allow opt-out for users who need the current ParquetDataset

      This is mostly exploratory to see which of the python tests fail

      Attachments

        Issue Links

          Activity

            People

              jorisvandenbossche Joris Van den Bossche
              bkietz Ben Kietzman
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 8h 40m
                  8h 40m