Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5009

Query with a simple join fails on Hive generated parquet

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 1.9.0
    • Fix Version/s: 1.9.0
    • Component/s: Storage - Parquet
    • Labels:
      None
    • Environment:

      Commit ID: 5a439424594eb10d113163eaa1fdf8034f387235c
      1.9.0 SNAPSHOT - Nov 5 2016

      Description

      Query:

      SELECT *
      FROM store_sales ss, customer c
      WHERE  ss.ss_customer_sk = c.c_customer_sk 
      LIMIT 1; 
      

      Error:

      Error: SYSTEM ERROR: IOException: End of stream reached while initializing buffered reader.
      
      Fragment 2:0
      
      [Error Id: 93726aea-1d62-4e7c-a2bf-1d7cc1e834e4 on abhi1:31010]
      
        (org.apache.drill.common.exceptions.DrillRuntimeException) Error in parquet record reader.
      ...
      ...
       Caused By (org.apache.drill.common.exceptions.ExecutionSetupException) Error opening or reading metadata for parquet file at location: customer.parquet
          org.apache.drill.exec.store.parquet.columnreaders.PageReader.<init>():145
          org.apache.drill.exec.store.parquet.columnreaders.AsyncPageReader.<init>():59
          org.apache.drill.exec.store.parquet.columnreaders.ColumnReader.<init>():96
          org.apache.drill.exec.store.parquet.columnreaders.NullableColumnReader.<init>():39
          org.apache.drill.exec.store.parquet.columnreaders.NullableFixedByteAlignedReaders$NullableFixedByteAlignedReader.<init>():58
          org.apache.drill.exec.store.parquet.columnreaders.ColumnReaderFactory.getNullableColumnReader():252
          org.apache.drill.exec.store.parquet.columnreaders.ColumnReaderFactory.createFixedColumnReader():186
          org.apache.drill.exec.store.parquet.columnreaders.ParquetRecordReader.setup():402
          org.apache.drill.exec.physical.impl.ScanBatch.next():212
          org.apache.drill.exec.record.AbstractRecordBatch.next():119
          org.apache.drill.exec.record.AbstractRecordBatch.next():109
          org.apache.drill.exec.record.AbstractSingleRecordBatch.innerNext():51
          org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext():135
          org.apache.drill.exec.record.AbstractRecordBatch.next():162
          org.apache.drill.exec.physical.impl.BaseRootExec.next():104
          org.apache.drill.exec.physical.impl.broadcastsender.BroadcastSenderRootExec.innerNext():95
          org.apache.drill.exec.physical.impl.BaseRootExec.next():94
          org.apache.drill.exec.work.fragment.FragmentExecutor$1.run():232
          org.apache.drill.exec.work.fragment.FragmentExecutor$1.run():226
          java.security.AccessController.doPrivileged():-2
          javax.security.auth.Subject.doAs():415
          org.apache.hadoop.security.UserGroupInformation.doAs():1595
          org.apache.drill.exec.work.fragment.FragmentExecutor.run():226
          org.apache.drill.common.SelfCleaningRunnable.run():38
          java.util.concurrent.ThreadPoolExecutor.runWorker():1145
          java.util.concurrent.ThreadPoolExecutor$Worker.run():615
          java.lang.Thread.run():745
      ...
      

      Log attached.

        Attachments

        1. DRILL-5009.log.txt
          1.84 MB
          Abhishek Girish

          Issue Links

            Activity

              People

              • Assignee:
                parthc Parth Chandra
                Reporter:
                agirish Abhishek Girish
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: