Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-9673

Use unbiased standard deviation in DataFrame.describe

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 1.5.0
    • Fix Version/s: None
    • Component/s: SQL
    • Labels:
      None
    • Target Version/s:

      Description

      It is common to compute unbiased standard deviation in statistics. Though is doesn't matter much for big data, it is nice to be consistent with existing statistics tools.

        Attachments

          Activity

            People

            • Assignee:
              brkyvz Burak Yavuz
              Reporter:
              mengxr Xiangrui Meng
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: