Details
-
Umbrella
-
Status: Closed
-
Major
-
Resolution: Done
-
None
-
None
-
None
Description
Here is the todo list of SPARK-11604 found issues:
Note: I did not list the SparkR related features (such as ml.feature.Interaction). We have supported RFormula as a wrapper at Python side, I think we should discuss the necessary to support other R related features at Python side.
- Missing classes
- ml.attribute
SPARK-8516 - ml.feature
- QuantileDiscretizer
SPARK-11922 - ChiSqSelector
SPARK-11923
- QuantileDiscretizer
- ml.classification
- OneVsRest
SPARK-7861
- OneVsRest
- ml.clustering
- LDA
SPARK-11940
- LDA
- mllib.clustering
- BisectingKMeans
SPARK-11944
- BisectingKMeans
- ml.attribute
- Missing methods/parameters
SPARK-11938- ml.classification
SPARK-11815SPARK-11820 - ml.feature
SPARK-11925 - ml.clustering
SPARK-11945 - mllib.linalg
SPARK-12040SPARK-12041 - mllib.stat.test.StreamingTest
SPARK-12042
- ml.classification
- Docs:
- ml.classification
SPARK-11875
- ml.classification
Attachments
Issue Links
- is related to
-
SPARK-11604 ML 1.6 QA: API: Python API coverage
-
- Resolved
-
- relates to
-
SPARK-12810 PySpark CrossValidatorModel should support avgMetrics
-
- Resolved
-
1.
|
Python API for ml.feature.QuantileDiscretizer |
|
Resolved | Holden Karau |
2.
|
Update doc for PySpark HasCheckpointInterval |
|
Resolved | Yanbo Liang |
3.
|
PySpark LiR & LoR should support weightCol |
|
Resolved | Yanbo Liang |
4.
|
Add PySpark missing methods for ml.feature during Spark 1.6 QA |
|
Resolved | Yanbo Liang |
5.
|
Python API for ml.feature.ChiSqSelector |
|
Resolved | Xusen Yin |
6.
|
PySpark DecisionTreeClassifier & DecisionTreeRegressor should support setSeed |
|
Resolved | Yanbo Liang |
7.
|
Expose numFeatures in all ML PredictionModel for PySpark |
|
Resolved | Kai |
8.
|
Python API for ml.clustering.LDA |
|
Resolved | Jeff Zhang |
9.
|
Python API for mllib.clustering.BisectingKMeans |
|
Resolved | Holden Karau |
10.
|
Add computeCost to KMeansModel for PySpark spark.ml |
|
Resolved | Yanbo Liang |
11.
|
Add toJson/fromJson to Vector/Vectors for PySpark |
|
Closed | Unassigned |
12.
|
Add columnSimilarities to IndexedRowMatrix for PySpark |
|
Resolved | Kai Jiang |
13.
|
Feature parity for pyspark.mllib StandardScalerModel |
|
Resolved | Holden Karau |