Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
In GroupByOperator, objects are first deep copied and then check whether or not the object is in the hash table (in hash-mode aggregation). In fact, object deep copy could be very expensive (around 5% CPU time). A simple change could be generate the object without deep copy through ObjectInspector and check its existence in the hash table. If not exists, we call deep copy.
Attachments
Attachments
Issue Links
- blocks
-
HIVE-535 Memory-efficient hash-based Aggregation
- Open