Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2628

[Python] parquet.write_to_dataset is memory-hungry on large DataFrames

    XMLWordPrintableJSON

Details

    Description

      See discussion in https://github.com/apache/arrow/issues/1749. We should consider strategies for writing very large tables to a partitioned directory scheme.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated: