Description
Add python support for the Power Iteration Clustering feature. Here is a fragment of the python API as we plan to implement it:
/**
- Java stub for Python mllib PowerIterationClustering.run()
*/
def trainPowerIterationClusteringModel(
data: JavaRDD[(java.lang.Long, java.lang.Long, java.lang.Double)],
k: Int,
maxIterations: Int,
runs: Int,
initializationMode: String,
seed: java.lang.Long): PowerIterationClusteringModel = {
val picAlg = new PowerIterationClustering()
.setK(k)
.setMaxIterations(maxIterations)
try
{ picAlg.run(data.rdd.persist(StorageLevel.MEMORY_AND_DISK)) }finally
{ data.rdd.unpersist(blocking = false) }}
Attachments
Issue Links
- duplicates
-
SPARK-5963 [MLLIB] Python support for Power Iteration Clustering
- Closed
- Is contained by
-
SPARK-7536 Audit MLlib Python API for 1.4
- Resolved
- is duplicated by
-
SPARK-6260 Python API for PowerIterationClustering
- Closed
- is related to
-
SPARK-7541 Check model save/load for MLlib 1.4
- Resolved
- relates to
-
SPARK-6254 MLlib Python API parity check at 1.3 release
- Closed
- links to