Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
Impala 4.0.0
-
None
-
ghx-label-11
Description
I believe that there are memory leaks in aggregate UDFs that are using Apache Datasketches. Sketch or union objects are placed in a buffer to maintain the state of the aggregation, and later deallocated without calling destructors for those objects.
Also there is unnecessary overhead during the merge phase of some of those UDFs, when an instance of a union is created for every merge operation and finalized by calling a quite expensive get_result() every time as well. This second issue is about performance, not a bug.