Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-8021

Add estimated cardinality to EXPLAIN output

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • Impala 3.1.0
    • Impala 3.2.0
    • Frontend
    • None
    • ghx-label-2

    Description

      The EXPLAIN output provides much useful information in the plan tree. All our planning decisions are based on cardinality; but it appears in the EXPLAIN output only for the EXTENDED level. The profile only contains the plan from the STANDARD level. This change proposes to include row size and cardinality even in the STANDARD level.

      The nodes that have the information call it "cardinality", so continue to use that term.

      Add cardinality to each node so it appears something like this:

      HASH JOIN [INNER JOIN, BROADCAST]
      |  row-size=89B cardinality=1.23G
      |
      |--SCAN HDFS [db.table]
      |     partitions=2/123 files=2 size=4.56MB row-size=89B cardinality=7.89M
      

      Cardinality should appear in all levels above MINIMAL. Cardinality is not needed for EXCHANGE since it can be inferred from other nodes.

      Also, the existing code prints large cardinalities in detail: 1234567890, which is hard to read. Use the abbreviated output, using metric (power of 1000) units, so 1.23G instead.

      Attachments

        Issue Links

          Activity

            People

              Paul.Rogers Paul Rogers
              Paul.Rogers Paul Rogers
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: