Details
-
New Feature
-
Status: Closed
-
Minor
-
Resolution: Duplicate
-
None
-
None
-
None
-
None
Description
Add the ability to compute the mean and standard deviations of each vector (LabeledPoint) component and normalize each vector in the RDD, using only RDD transformations. The result is an RDD of Vectors where each column has a mean of zero and standard deviation of one.
Attachments
Attachments
Issue Links
- duplicates
-
SPARK-2272 Feature scaling which standardizes the range of independent variables or features of data.
- Resolved