Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-369

[Python] Add ability to convert multiple record batches at once to pandas

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.2.0
    • Python

    Description

      Instead of only being able to only convert single single record batches and tables that consist only of single ColumnChunks, we should also support the construction of Pandas DataFrames from multiple RecordBatches. In the most simple way, we would convert each batch to a Pandas DataFrame and then concat them all together. A second (and preferred) implementation would extend the C++ function ConvertColumnToPandas in python/src/pyarrow/adapters/pandas.* to work on chunked columns.

      Attachments

        Issue Links

          Activity

            People

              bryanc Bryan Cutler
              wesm Wes McKinney
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Slack

                  Issue deployment