Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-11476

Incorrect function referred to in MLib Random data generation documentation

    Details

      Description

      http://spark.apache.org/docs/latest/mllib-statistics.html in the "Random data generation", a comment in the example code says:

      Generate a random double RDD that contains 1 million i.i.d. values drawn from the standard normal distribution `N(0, 1)`, evenly distributed in 10 partitions.

      But it then calls normalRDD(), which does not do that - a call to uniformRDD() with the same parameters would do what the comment claims.

        Attachments

          Activity

            People

            • Assignee:
              srowen Sean Owen
              Reporter:
              jasonb Jason Blochowiak
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 10m
                10m
                Remaining:
                Remaining Estimate - 10m
                10m
                Logged:
                Time Spent - Not Specified
                Not Specified