Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-23907

Hash table type should be considered for calculating the Map join table size

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      For some join like Anti join and Semi join , hash set is used instead of a hash table. This is done as these joins do not emit the right side columns and just an existence check is enough for join.  When we check for the  table size , during map join conversion , this info is not considered. The hash table size for these join will be considerably small and thus hash table for bigger table can fit into memory.

       

      Attachments

        Issue Links

          Activity

            People

              maheshk114 mahesh kumar behera
              maheshk114 mahesh kumar behera
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: