Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
5.0.0
-
None
Description
When reading a JSON Lines file, a segfault can happen if there's a missing field at one point.
In particular when the missing field is supposed to be a list, and if the block size is small enough.
Here is an example to reproduce:
import io import pyarrow.json as paj batch = b'{"a": [], "b": 1}\n{"b": 1}' block_size = 12 paj.read_json( io.BytesIO(batch), read_options=paj.ReadOptions(block_size=block_size) )
Attachments
Issue Links
- is superceded by
-
ARROW-13871 [C++] JSON reader can fail if a list array key is present in one chunk but not in a later chunk
- Resolved