Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-7226

[JSON][Python] Json loader fails on example in documentation.

    XMLWordPrintableJSON

Details

    Description

      I was just trying this with the example found in the pyarrow docs at http://arrow.apache.org/docs/python/json.html

      The documented example does not work. Is this related to this issue, or is it another matter?

      It says to load the following JSON file:

      {"a": [1, 2], "b": {"c": true, "d": "1991-02-03"}}
      {"a": [3, 4, 5], "b": {"c": false, "d": "2019-04-01"}}

      I fixed this to make it valid JSON (It is valid [JSON Lines|http://jsonlines.org/], but that's another issue):

      [{"a": [1, 2], "b": {"c": true, "d": "1991-02-03",}}
      {"a": [3, 4, 5], "b": {"c": false, "d": "2019-04-01"]}}

      Then reading the JSON from a file called `my_data.json`:

      from pyarrow import json
      table = json.read_json("my_data.json")

      Gives the following error:

      ---------------------------------------------------------------------------}}
       ArrowInvalid Traceback (most recent call last)
       <ipython-input-69-f974c21f0941> in <module>()
       1 from pyarrow import json
       ----> 2 table = json.read_json('test.json')
      ~/.local/share/virtualenvs/parquet-ifRxINoC/lib/python3.7/site-packages/pyarrow/_json.pyx in pyarrow._json.read_json()
      ~/.local/share/virtualenvs/parquet-ifRxINoC/lib/python3.7/site-packages/pyarrow/error.pxi in pyarrow.lib.check_status()
      ArrowInvalid: JSON parse error: A column changed from object to array
      

      Attachments

        Issue Links

          Activity

            People

              arw2019 Andrew Wieteska
              RinkeHoekstra Rinke Hoekstra
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1.5h
                  1.5h