Details
-
Improvement
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
Description
Another aggregate function we’ll need to look at is median().
That should in theory map to the existing quantile kernel and the hopefully-coming-soon hash_quantile kernel with q = 0.5, but that will not work right now for several reasons (see https://issues.apache.org/jira/browse/ARROW-13309).
However we also have a pair of
approximate quantile kernels: tdigest and hash_tdigest that we could map median() to.
Attachments
Issue Links
- blocks
-
ARROW-13618 [R] Use Arrow engine for summarize() by default
-
- Resolved
-
- depends upon
-
ARROW-13520 [C++] Implement hash_aggregate approximate quantile kernel
-
- Resolved
-
- is blocked by
-
ARROW-12669 [C++] Kernel to return Array of elements at index of list in ListArray
-
- Resolved
-
-
ARROW-14050 [C++] tdigest, quantile return empty arrays when nulls not skipped
-
- Resolved
-
-
ARROW-14052 [C++] Add appx_median, hash_appx_median functions
-
- Resolved
-
-
ARROW-13740 [R] summarize() should not eagerly evaluate
-
- Resolved
-
- is related to
-
ARROW-14021 [R] Bind median() and quantile() to exact not approximate median and quantile
-
- Open
-
- links to