Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7238

Drill does not use DirectScan for non-existent columns

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.17.0
    • Component/s: Metadata
    • Labels:

      Description

      This query does not use the summary metadata cache file:
      select count(int_nulls_id), count(int_id), count(ss_ticket_number), count(extra) from store_sales_null_blocks_int;

      In this query, extra is a column that does not exist (non-existent column).

      Here is the explain plan:

      | 00-00    Screen
      00-01      Project(EXPR$0=[$0], EXPR$1=[$1], EXPR$2=[$2], EXPR$3=[$3])
      00-02        StreamAgg(group=[{}], EXPR$0=[$SUM0($0)], EXPR$1=[$SUM0($1)], EXPR$2=[$SUM0($2)], EXPR$3=[$SUM0($3)])
      00-03          UnionExchange
      01-01            StreamAgg(group=[{}], EXPR$0=[COUNT($0)], EXPR$1=[COUNT($1)], EXPR$2=[COUNT($2)], EXPR$3=[COUNT($3)])
      01-02              Scan(table=[[dfs, parquet_metadata_cache, store_sales_null_blocks_int]], groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/metadata_cache/store_sales_null_blocks_int]], selectionRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int, numFiles=1, numRowGroups=11, usedMetadataFile=true, cacheFileRoot=/drill/testdata/metadata_cache/store_sales_null_blocks_int, columns=[`int_nulls_id`, `int_id`, `ss_ticket_number`, `extra`]]])
      

      This is a regression from Drill 1.15.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                vdonapati Venkata Jyothsna Donapati
                Reporter:
                vdonapati Venkata Jyothsna Donapati
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: