Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
0.14.1
-
python(3.7.3), pyarrow(0.14.1), arrow-cpp(0.14.1), parquet-cpp(1.5.1), Arch Linux x86_64
Description
I'm not sure if this should be reported to Parquet or here.
When I tried to serialize a pyarrow table with a fixed size binary field (holds 16 byte UUID4 information) to a parquet file, segmentation fault occurs.
Here is the minimal example to reproduce:
import pyarrow as pa
from pyarrow import parquet as pq
data = {"col": pa.array([b"1234" for _ in range(10)])}
fields = [("col", pa.binary(4))]
schema = pa.schema(fields)
table = pa.table(data, schema)
pq.write_table(table, "test.parquet")
segmentation fault (core dumped) ipython
Yet, it works if I don't specify the size of the binary field.
import pyarrow as pa
from pyarrow import parquet as pq
data = {"col": pa.array([b"1234" for _ in range(10)])}
fields = [("col", pa.binary())]
schema = pa.schema(fields)
table = pa.table(data, schema)
pq.write_table(table, "test.parquet")
Thanks,
Attachments
Issue Links
- links to