Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
3.4.0
-
None
-
None
Description
`Groupby._reduce_for_stat_function` is a common helper function leveraged by multiple statistical functions of GroupBy objects.
It defines parameters `only_numeric` and `bool_as_numeric` to control accepted Spark types.
To be consistent with pandas API, we may also have to introduce `str_as_numeric` for `sum` for example.
Instead of introducing parameters designated for each Spark type, the PR is proposed to introduce a parameter `accepted_spark_types` to specify accepted types of Spark columns to be aggregated.