Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-11518

Use Iceberg stats for Table Stats if there's no stats in HMS

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Frontend
    • ghx-label-5

    Description

      Currently we only use HMS stats, even for Iceberg tables:
      https://github.com/apache/impala/blob/14b9fb97b59de1749fd93b0469ff8ea6f030021d/fe/src/main/java/org/apache/impala/catalog/local/LocalTable.java#L163
      https://github.com/apache/impala/blob/master/fe/src/main/java/org/apache/impala/catalog/Table.java#L413

      IcebergTable and LocalIcebergTable should set 'tableStats_' based on Iceberg metadata.

      For tables without delete files we should always use Iceberg stats because it will be precise whereas HMS stats can be stale.

      For tables with delete files the HMS stats might be more precise as they take the deleted rows into account.

      Attachments

        Activity

          People

            Unassigned Unassigned
            boroknagyz Zoltán Borók-Nagy
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: