Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-24566

Add Parquet Stats Optimization

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Parquet files store min/max/count data in foot metadata.

      When a query is submitted to a Parquet table, and stats are not available, Hive should launch a single multi-threaded processor that simply reads the meta data of each Parquet file instead of walking through every single record in the table.

      Attachments

        Activity

          People

            Unassigned Unassigned
            belugabehr David Mollitor
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: