Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6841

Similar to `stats.py` in Python, add support for mean, median, stdev etc.

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0
    • SparkR
    • None

    Description

      Similar to `stats.py` in Python, we should add support for mean, median, stdev etc. More specifically the functions we should support include
      1. sum(rdd)
      2. histogram(rdd, buckets)
      3. mean(rdd)
      4. variance(rdd)
      5. stdev(rdd)
      6. sampleStdev(rdd)
      7. sampleVariance(rdd)

      Attachments

        Activity

          People

            qhuang Qian Huang
            shivaram Shivaram Venkataraman
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: