Details
Description
The broadcasted centers in KMeans are never un-persisted. As a result, memory usage accumulates with usage causing a memory leak.
Attachments
Issue Links
- relates to
-
SPARK-10329 Cost RDD in k-means|| initialization is not storage-efficient
- Resolved
-
SPARK-10354 First cost RDD shouldn't be cached in k-means|| and the following cost RDD should use MEMORY_AND_DISK
- Resolved
-
SPARK-11560 Optimize KMeans implementation / remove 'runs' from implementation
- Resolved
- links to