Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-5364

Make sure Hudi's Column Stats are wired into Spark's relation stats

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Critical
    • Resolution: Unresolved
    • 0.12.1
    • 1.1.0
    • spark, spark-sql
    • None
    • 4

    Description

      Currently, we're leveraging CSI exclusively to better prune the target files.

      Additionally, we should wire in stats from CSI into Spark's `CatalogStatistics` which in turn will be leveraged by Spark's Optimization rules for better planning.

      Attachments

        Activity

          People

            alexey.kudinkin Alexey Kudinkin
            alexey.kudinkin Alexey Kudinkin
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: