Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
1.0.1, 2.0.0
-
None
-
None
Description
I'm sorry if this was already reported, but there's an overflow issue in concatenation of large arrays
In [1]: import pyarrow as pa In [2]: str_array = pa.array(['a' * 128] * 10**8) In [3]: large_array = pa.concat_arrays([str_array] * 50) Segmentation fault (core dumped)
I suppose that this should be handled by upcast to large_string.
Attachments
Issue Links
- is related to
-
ARROW-10799 [C++] Take on string chunked arrays slow and fails
- Closed
- is superceded by
-
ARROW-17828 [C++][Python] Large strings cause ArrowInvalid: offset overflow while concatenating arrays
- Open
- relates to
-
ARROW-10494 .take silently overflow on list array (when casting to large_list is needed)
- Open
-
ARROW-7731 [C++][Parquet] Support LargeListArray
- Closed