Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Duplicate
-
2.0.0
-
None
Description
- I'm loading a JSONlines object into a table using
pa.json.readjson
It contains one column that is a nested dictionary.
- I select a row by key and inspect its nested dictionary.
- I write the table to parquet
- I load the table again from the parquet file
- I check the same key and the nested dictionary is not the same.
To reproduce:
Find the attached JSONLines file and Jupyter Notebook.
The json file contains entries per customer with a generated `msisdn`, `scoring_request_id` and `scorecard_result` object. Each `scorecard result consists of a list of feature objects, all with the value the same as the msidn` and a score.
The notebook reads the file and demonstrates the issue.
Attachments
Attachments
Issue Links
- is duplicated by
-
ARROW-10493 [C++][Parquet] Writing nullable nested strings results in wrong data in file
- Resolved