Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.12.1, 1.12.2
-
None
-
None
Description
This ticket relates to PARQUET-2027. In the previous ticket for two parquet files produced by 1.11.x merging was failing in 1.12.0. For 1.12.1 merging was fixed, i. e. it doesn't fail. But in the same time it results with a corrupted output file. The error:
Dictionary page must be before data page.
is thrown when one tries to read it. It comes from this https://github.com/apache/parquet-cpp/blob/master/src/parquet/arrow/record_reader.cc#L712.
I attached two example input files and the outcome of merging.