Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7760

REFRESH TABLE METADATA increases the planning time while querying over parquet files.

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 1.17.0
    • Fix Version/s: Future
    • Component/s: Metadata
    • Labels:
      None
    • Environment:

      Apache Drill Web UI ( 1.170.0 )

      Description

      After running
      REFRESH TABLE METADATA <table>
      The planning time increases for a particular query increases from 35 secs to more than 3 minutes.
      Parquet file consist of 10M rows and 7 columns and is partitioned on 2 columns ( both the column contains 8 unique values ).
      What could be the possible explanation for this behaviour ?

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              mtewathia99 Mukul Tewathia
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: