Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
I was just trying this with the example found in the pyarrow docs at http://arrow.apache.org/docs/python/json.html
The documented example does not work. Is this related to this issue, or is it another matter?
It says to load the following JSON file:
{"a": [1, 2], "b": {"c": true, "d": "1991-02-03"}}
{"a": [3, 4, 5], "b": {"c": false, "d": "2019-04-01"}}
I fixed this to make it valid JSON (It is valid [JSON Lines|http://jsonlines.org/], but that's another issue):
[{"a": [1, 2], "b": {"c": true, "d": "1991-02-03",}}
{"a": [3, 4, 5], "b": {"c": false, "d": "2019-04-01"]}}
Then reading the JSON from a file called `my_data.json`:
from pyarrow import json
table = json.read_json("my_data.json")
Gives the following error:
---------------------------------------------------------------------------}} ArrowInvalid Traceback (most recent call last) <ipython-input-69-f974c21f0941> in <module>() 1 from pyarrow import json ----> 2 table = json.read_json('test.json') ~/.local/share/virtualenvs/parquet-ifRxINoC/lib/python3.7/site-packages/pyarrow/_json.pyx in pyarrow._json.read_json() ~/.local/share/virtualenvs/parquet-ifRxINoC/lib/python3.7/site-packages/pyarrow/error.pxi in pyarrow.lib.check_status() ArrowInvalid: JSON parse error: A column changed from object to array
Attachments
Issue Links
- links to