Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-16119 [Python] Deprecate the legacy ParquetDataset custom python-based implementation
  3. ARROW-16122

[Python] Change use_legacy_dataset default and deprecate no-longer supported keywords in parquet.write_to_dataset

    XMLWordPrintableJSON

Details

    Description

      Currently, the pq.write_to_dataset function also had a use_legacy_dataset keyword, but we should:

      1) in case of use_legacy_dataset=True, ensure we raise deprecation warnings for all keywords that won't be supported in the new implementation (eg partition_filename_cb)
      2) raise a deprecation warning for use_legacy_dataset=True, and/or already switch the default?

      Attachments

        Issue Links

          Activity

            People

              alenka Alenka Frim
              jorisvandenbossche Joris Van den Bossche
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 5h
                  5h