Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-28410

Partition-Aware Optimization for Iceberg or OTF

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 4.0.0
    • None
    • None

    Description

      This project aims to allow users using Apache Iceberg or other non-native table formats to take advantage of Hive's advanced optimizations.

       

      Apache Hive provides some optimizations depending on the storage layout of Hive native tables. Bucket Map Join, Sort Merge Bucket Join, or GroupByOptimizer are some of them. Those optimizations are not open to non-native tables because they rely on a piece of hardcoded logic. For example, hashing algorithms are implemented outside StorageHandlers, so enabling Bucket Map Join on Iceberg tables with Bucket Transforms is unrealistic.

       

      We have some discussions in HIVE-27734; this is the first design doc.

      Attachments

        Issue Links

          Activity

            People

              okumin Shohei Okumiya
              okumin Shohei Okumiya
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: