Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
At the moment, for pyarrow.Array instances, we have a method called to_pandas. While this method returns NumPy Arrays, it returns them in the form that Pandas would use them in its Series. The difference here is visible for example in the case of integers with null values. For Pandas, we convert it into a float array and set all entries to NaN where we have null entries in the Arrow array. For vanilla NumPy arrays, we would return a tuple of a valid bytemap (not bitmap!) and a values array. The values array in this case should simply be a view on the underlying Arrow buffer.
Attachments
Issue Links
- is duplicated by
-
ARROW-2295 [Python] Add Array.to_numpy functions
- Closed
- is related to
-
ARROW-2853 [Python] Implementing support for zero copy NumPy arrays in libarrow_python
- Closed
- links to