Details
-
New Feature
-
Status: Resolved
-
Minor
-
Resolution: Won't Fix
-
None
-
None
Description
Mini-batch is a version of KMeans that uses a randomly-sampled subset of the data points in each iteration instead of the full set of data points, improving performance (and in some cases, accuracy). The mini-batch version is compatible with the KMeans|| initialization algorithm currently implemented in MLlib.
I suggest adding KMeans Mini-batch as an alternative.
I'd like this to be assigned to me.
Attachments
Attachments
Issue Links
- is duplicated by
-
SPARK-14174 Implement the Mini-Batch KMeans
- Resolved
- links to