Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-16328

HoS: more aggressive mapjoin optimization when hive.spark.use.ts.stats.for.mapjoin is true

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 3.0.0
    • Logical Optimizer
    • None

    Description

      In HIVE-15489, when hive.spark.use.ts.stats.for.mapjoin is set to true, and if the JOIN op has any upstream RS operator, then we will stop converting the JOIN op to MAPJOIN op.

      However, this is overly conservative. A better solution is to treat the branch that has upstream RS as the big table and check if all other branches are map-only AND can fit in hash table size.

      Attachments

        1. HIVE-16328.1.patch
          3 kB
          Chao Sun
        2. HIVE-16328.2.patch
          5 kB
          Chao Sun

        Issue Links

          Activity

            People

              csun Chao Sun
              csun Chao Sun
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: