Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
0.14.0
-
None
-
None
Description
There is a huge skew in reducer load for NULL keys in shuffle joins, resulting in one slow reducer out of many.
The NULL keys will not contribute towards the inner JOIN condition for equality (unless using null-safes).
Attachments
Issue Links
- is superceded by
-
HIVE-7159 For inner joins push a 'is not null predicate' to the join sources for every non nullSafe join condition
- Closed