Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-8486

ParquetDecodingException: could not read bytes at offset

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.21.1
    • 1.21.2
    • Storage - Parquet
    • None

    Description

      Drill fails to read a parquet file with the following exception:

       

      Caused by: org.apache.parquet.io.ParquetDecodingException: could not read bytes at offset 591804
      	at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:42)
      	at org.apache.drill.exec.store.parquet.columnreaders.VarLenColumnBulkInput$ValuesReaderWrapper.getNextEntry(VarLenColumnBulkInput.java:754)
      	... 43 common frames omitted
      Caused by: java.io.EOFException: null
      	at org.apache.parquet.bytes.SingleBufferInputStream.read(SingleBufferInputStream.java:52)
      	at org.apache.parquet.bytes.BytesUtils.readIntLittleEndian(BytesUtils.java:83)
      	at org.apache.parquet.column.values.plain.BinaryPlainValuesReader.readBytes(BinaryPlainValuesReader.java:39)
      	... 44 common frames omitted 

       

       

      This issue only affects queries with store.parquet.flat.reader.bulk set to true(by default).

      Attaching the parquet file for the reproduce: test.parquet.

      Query: select log, app_name from dfs.tmp.`test.parquet`

      Attachments

        1. test.parquet
          6.16 MB
          Maksym Rymar

        Activity

          People

            rymarm Maksym Rymar
            rymarm Maksym Rymar
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: