Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-12200

Cap stats NDV from SetOperationStmt.createMetadata

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Impala 4.3.0
    • Impala 4.3.0
    • Frontend
    • None
    • ghx-label-3

    Description

      Union operator will create merged ColumnStats at SetOperationStmt.createMetadata where it adds all ColumnStats from its input children. One of the stats being accumulated is NDV (num distinct value). There is an opportunity to lower the resulting NDV if all source expression is referring to the same column. This lower NDV can benefit Aggregation node on top of the Union node because it can lower cardinality and memory estimate of the Aggregation node.

      Attachments

        Activity

          People

            rizaon Riza Suminto
            rizaon Riza Suminto
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: