Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2040

[Python] Deserialized Numpy array must keep ref to underlying tensor

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8.0
    • Fix Version/s: 0.9.0
    • Component/s: None

      Description

      pyarrow.deserialize works fine, however.

      Python 2.7.12 (default, Nov 20 2017, 18:23:56)
      [GCC 5.4.0 20160609] on linux2
      Type "help", "copyright", "credits" or "license" for more information.
      >>> import pyarrow as pa, numpy as np
      >>> with open('test.pyarrow', 'w') as f:
      ...     f.write(pa.serialize(np.arange(10, dtype=np.int32)).to_buffer().to_pybytes())
      ...
      >>> pa.read_serialized(pa.OSFile('test.pyarrow')).deserialize()
      array([54846320, 0, 45484448, 0, 4, 5, 6, 7, 8, 9], dtype=int32)
      >>> pa.deserialize(pa.frombuffer(open('test.pyarrow').read()))
      array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9], dtype=int32)
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                pitrou Antoine Pitrou
                Reporter:
                rshin Richard Shin
              • Votes:
                0 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: