Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-6331

Parquet filter pushdown does not support the native hive reader

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Initially HiveDrillNativeParquetGroupScan was based mainly on HiveScan, the core difference between them was
      that HiveDrillNativeParquetScanBatchCreator was creating ParquetRecordReader instead of HiveReader.
      This allowed to read Hive parquet files using Drill native parquet reader but did not expose Hive data to Drill optimizations.
      For example, filter push down, limit push down, count to direct scan optimizations.
      Hive code had to be refactored to use the same interfaces as ParquestGroupScan in order to be exposed to such optimizations.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            arina Arina Ielchiieva
            arina Arina Ielchiieva
            Vitalii Diravka Vitalii Diravka
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment