Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.21.1
-
None
Description
Drill fails to read a parquet file with the following exception:
Caused by: org.apache.parquet.io.ParquetDecodingException: could not read bytes at offset 591804
at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:42)
at org.apache.drill.exec.store.parquet.columnreaders.VarLenColumnBulkInput$ValuesReaderWrapper.getNextEntry(VarLenColumnBulkInput.java:754)
... 43 common frames omitted
Caused by: java.io.EOFException: null
at org.apache.parquet.bytes.SingleBufferInputStream.read(SingleBufferInputStream.java:52)
at org.apache.parquet.bytes.BytesUtils.readIntLittleEndian(BytesUtils.java:83)
at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:39)
... 44 common frames omitted
This issue only affects queries with store.parquet.flat.reader.bulk set to true(by default).
Attaching the parquet file for the reproduce: test.parquet.
Query: select log, app_name from dfs.tmp.`test.parquet`