Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-86

Python: Implement zero-copy Arrow-to-Pandas conversion

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.1.0
    • Component/s: Python
    • Labels:
      None

      Description

      For this to work, we need:

      • a check if the Arrow Array could be represented as Pandas Series using the same memory buffer
      • a numpy.ndarray subclass that keeps a reference to the Arrow Array

      With the NumPy C-API it is already possible to create a numpy.ndarray instance that uses the Arrow Array memory buffer internally but there is no way to keep track if the numpy.ndarray still lives or the underlying memory can be free'd.

        Attachments

          Activity

            People

            • Assignee:
              uwe Uwe Korn
              Reporter:
              uwe Uwe Korn
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: