We have many nested parquet files generated from Apache Spark for ranking problems, and we would like to load them in python for other programs to consume.
The schema looks like
And when I tried to load it with nightly build pyarrow on Oct 4, 2017, I got the following error.
I somehow get the impression that after https://issues.apache.org/jira/browse/PARQUET-911 is merged, we should be able to load the nested parquet in pyarrow.
Any insight about this?