[ARROW-13451] [C++][Compute] Consider removing ScalarAggregateKernel - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: None
Fix Version/s: None
Component/s: C++
Labels:
- pull-request-available

External issue URL:
https://github.com/apache/arrow/issues/29116

Description

Scalar aggregation does not incur large memory overhead for the associated KernelState objects, so maybe it'd be acceptable to remove explicit scalar aggregation kernels in favor of reusing grouped aggregation kernels with a single group. This would decrease our maintenance burden significantly, and if the benchmarks don't show a regression for single-group aggregation then there's no reason not to.

Even if there is a performance regression we could bundle the scalar and grouped aggregate kernels in the same compute::Function and decide between them in Dispatch*, rather than confusingly defining distinct "sum" and "hash_sum" functions

Attachments

Issue Links

links to

GitHub Pull Request #10813

Activity

People

Assignee:: Unassigned

Reporter:: Ben Kietzman

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 26/Jul/21 19:02

Updated:: 11/Jan/23 08:33

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

2h 10m