Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7760

REFRESH TABLE METADATA increases the planning time while querying over parquet files.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.17.0
    • Future
    • Metadata
    • None
    • Apache Drill Web UI ( 1.170.0 )

    Description

      After running
      REFRESH TABLE METADATA <table>
      The planning time increases for a particular query increases from 35 secs to more than 3 minutes.
      Parquet file consist of 10M rows and 7 columns and is partitioned on 2 columns ( both the column contains 8 unique values ).
      What could be the possible explanation for this behaviour ?

      Attachments

        Activity

          People

            Unassigned Unassigned
            mtewathia99 Mukul Tewathia
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: