Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
0.14.0
-
None
-
None
Description
In JoinStatsRule.process the join column statistics are stored in HashMap joinedColStats, the key used which is the ColStatistics.fqColName is duplicated between join column in the same vertex, as a result distinctVals ends up having duplicated values which negatively affects the join cardinality estimation.
The duplicate keys are usually named KEY.reducesinkkey0.
Attachments
Attachments
Issue Links
- is blocked by
-
HIVE-9512 HIVE-9327 causing regression in stats annotation
- Closed
- is related to
-
HIVE-8769 Physical optimizer : Incorrect CE results in a shuffle join instead of a Map join (PK/FK pattern not detected)
- Resolved
- relates to
-
HIVE-10107 Union All : Vertex missing stats resulting in OOM and in-efficient plans
- Resolved
- links to