Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-86

Python: Implement zero-copy Arrow-to-Pandas conversion

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.1.0
    • Python
    • None

    Description

      For this to work, we need:

      • a check if the Arrow Array could be represented as Pandas Series using the same memory buffer
      • a numpy.ndarray subclass that keeps a reference to the Arrow Array

      With the NumPy C-API it is already possible to create a numpy.ndarray instance that uses the Arrow Array memory buffer internally but there is no way to keep track if the numpy.ndarray still lives or the underlying memory can be free'd.

      Attachments

        Activity

          People

            uwe Uwe Korn
            uwe Uwe Korn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment