Description
It would be useful if we can identify hash map collision issues early on.
We should add avg hash map probe metric to aggregate operator and hash join operator and report them. If the avg probe is greater than a specific (configurable) threshold, we should log an error at runtime.
The primary classes to look at are UnsafeFixedWidthAggregationMap, HashAggregateExec, HashedRelation, HashJoin.
Attachments
Issue Links
1.
|
Add hash map metrics to aggregate | Resolved | L. C. Hsieh | |
2.
|
Add hash map metrics to join | Resolved | L. C. Hsieh |