Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-6100

Intermittent failure while reading Parquet file footer during planning phase

    XMLWordPrintableJSON

Details

    Description

      When running queries from multiple users for a view which then refers to a parquet data file, intermittent failure is seen during planning phase. The failure happens when the Parquet data file which view owner has access to is read to create metadata cache. Query user doesn't have direct access to the Parquet data file but has read access to the view which in turn is accessing the actual data. When Parquet Metadata file is created it's created as ProcessUser based on DRILL-4143 but footer is not read under the process user context. While running concurrent queries from several client sporadic failures was observed since at times footer was being read as Query User which doesn't have access to the file.

       

      2018-01-12 13:19:57,267
      Caused by: org.apache.drill.common.exceptions.DrillRuntimeException: Failure creating scan.
       at org.apache.drill.exec.planner.logical.DrillScanRel.<init>(DrillScanRel.java:92) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.planner.logical.DrillScanRel.<init>(DrillScanRel.java:70) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.planner.logical.DrillScanRel.<init>(DrillScanRel.java:63) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.planner.logical.DrillScanRule.onMatch(DrillScanRule.java:37) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.calcite.plan.volcano.VolcanoRuleCall.onMatch(VolcanoRuleCall.java:228) ~[calcite-core-1.4.0-drill-r21.jar:1.4.0-drill-r21]
       ... 15 common frames omitted
       Caused by: org.apache.hadoop.security.AccessControlException: Open failed for file: /env/test/data/final
       at com.mapr.fs.MapRClientImpl.open(MapRClientImpl.java:265) ~[maprfs-5.2.2-mapr.jar:5.2.2-mapr]
       at com.mapr.fs.MapRFileSystem.open(MapRFileSystem.java:938) ~[maprfs-5.2.2-mapr.jar:5.2.2-mapr]
       at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:803) ~[hadoop-common-2.7.0-mapr-1607.jar:na]
       at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:425) ~[parquet-hadoop-1.8.1-drill-r0.jar:1.8.1-drill-r0]
       at org.apache.parquet.hadoop.ParquetFileReader.readFooter(ParquetFileReader.java:412) ~[parquet-hadoop-1.8.1-drill-r0.jar:1.8.1-drill-r0]
       at org.apache.drill.exec.store.parquet.Metadata.getParquetFileMetadata_v3(Metadata.java:395) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata.access$000(Metadata.java:85) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata$MetadataGatherer.runInner(Metadata.java:323) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata$MetadataGatherer.runInner(Metadata.java:311) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.TimedRunnable.run(TimedRunnable.java:56) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.TimedRunnable.run(TimedRunnable.java:122) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata.getParquetFileMetadata_v3(Metadata.java:285) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata.getParquetTableMetadata(Metadata.java:264) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata.getParquetTableMetadata(Metadata.java:249) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.Metadata.getParquetTableMetadata(Metadata.java:121) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.ParquetGroupScan.init(ParquetGroupScan.java:733) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.ParquetGroupScan.<init>(ParquetGroupScan.java:230) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.ParquetGroupScan.<init>(ParquetGroupScan.java:190) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.ParquetFormatPlugin.getGroupScan(ParquetFormatPlugin.java:169) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.parquet.ParquetFormatPlugin.getGroupScan(ParquetFormatPlugin.java:67) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.dfs.FileSystemPlugin.getPhysicalScan(FileSystemPlugin.java:146) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.store.AbstractStoragePlugin.getPhysicalScan(AbstractStoragePlugin.java:100) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.planner.logical.DrillTable.getGroupScan(DrillTable.java:85) ~[drill-java-exec-1.10.0.jar:1.10.0]
       at org.apache.drill.exec.planner.logical.DrillScanRel.<init>(DrillScanRel.java:90) ~[drill-java-exec-1.10.0.jar:1.10.0]
       ... 19 common frames omitted
       org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: AccessControlException: Open failed for file: /env/test/data/final/snapshot_period_id=1234567/000000_0, error: Permission denied (13)
      

       

      Attachments

        Issue Links

          Activity

            People

              shamirwasia Sorabh Hamirwasia
              shamirwasia Sorabh Hamirwasia
              Parth Chandra Parth Chandra
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: