Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-28410

Partition-Aware Optimization for Iceberg or OTF

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.0.0
    • None
    • None

    Description

      This project aims to allow users using Apache Iceberg or other non-native table formats to take advantage of Hive's advanced optimizations.

       

      Apache Hive provides some optimizations depending on the storage layout of Hive native tables. Bucket Map Join, Sort Merge Bucket Join, or GroupByOptimizer are some of them. Those optimizations are not open to non-native tables because they rely on a piece of hardcoded logic. For example, hashing algorithms are implemented outside StorageHandlers, so enabling Bucket Map Join on Iceberg tables with Bucket Transforms is unrealistic.

       

      We have some discussions in HIVE-27734; this is the first design doc.

      Attachments

        Activity

          People

            okumin Shohei Okumiya
            okumin Shohei Okumiya
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: