Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-5119

MapJoin & Partition Pruning (MapJoin can take advantage of materialized data to prune partitions of big table)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.11.0
    • None
    • Query Processor
    • None

    Description

      Map-Join predicates where the joining columns from big table (streamed table) are partition columns and corresponding columns from small table is not partitioned, the join would not prune the unnecessary partitions from big table. Since data for all small tables is materialized before big table is streamed, theoretically it would be possible to prune the unnecessary partitions from big table.

      Proposal document is at https://cwiki.apache.org/confluence/x/sgkHAg

      Attachments

        Issue Links

          Activity

            People

              jpullokkaran Laljo John Pullokkaran
              jpullokkaran Laljo John Pullokkaran
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: