Description
This is similar to SPARK-3365. Sample json is attached. Code to reproduce
var jsonDF = read.json("/tmp/t1.json") jsonDF.write.parquet("/tmp/t1.parquet")
The 'integration' object is empty in the json.
StackTrace:
.... Caused by: java.io.IOException: Could not read footer: java.lang.IllegalStateException: Cannot build an empty group at parquet.hadoop.ParquetFileReader.readAllFootersInParallel(ParquetFileReader.java:238) at org.apache.spark.sql.parquet.ParquetRelation2$MetadataCache.refresh(newParquet.scala:369) at org.apache.spark.sql.parquet.ParquetRelation2.org$apache$spark$sql$parquet$ParquetRelation2$$metadataCache$lzycompute(newParquet.scala:154) at org.apache.spark.sql.parquet.ParquetRelation2.org$apache$spark$sql$parquet$ParquetRelation2$$metadataCache(newParquet.scala:152) at org.apache.spark.sql.parquet.ParquetRelation2.refresh(newParquet.scala:197) at org.apache.spark.sql.sources.InsertIntoHadoopFsRelation.insert(commands.scala:134) ... 69 more Caused by: java.lang.IllegalStateException: Cannot build an empty group