Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Similar to `stats.py` in Python, we should add support for mean, median, stdev etc. More specifically the functions we should support include
1. sum(rdd)
2. histogram(rdd, buckets)
3. mean(rdd)
4. variance(rdd)
5. stdev(rdd)
6. sampleStdev(rdd)
7. sampleVariance(rdd)