Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-7151

NULL keys should not be shuffled for inner equi joins

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.14.0
    • None
    • Query Processor
    • None

    Description

      There is a huge skew in reducer load for NULL keys in shuffle joins, resulting in one slow reducer out of many.

      The NULL keys will not contribute towards the inner JOIN condition for equality (unless using null-safes).

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              gopalv Gopal Vijayaraghavan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: