Description
Currently our UDAFs use the UserDefinedAggregateFunction class. There are two drawbacks with this:
1) It is less efficient than Aggregator
2) UserDefinedAggregateFunction is deprecated and removed from Spark 3.2.0.
This story is for changing them to use Aggregator.
The UDAFs are located here:
https://github.com/apache/datafu/blob/main/datafu-spark/src/main/scala/datafu/spark/SparkUDAFs.scala
Here are some links explaining how to do this:
This change should be backwards compatible if possible; the tests in TestSparkUDAFs should all still pass.