Details
Description
This issue is for tracking the Google Summer of Code 2016 project for Kai Jiang: "Apache Spark: Exposing more R and Python APIs for MLlib"
See attached proposal for details. Note that the tasks listed in the proposal are tentative and can adapt as the community works on these various parts of MLlib.
This umbrella will contain links for tasks included in this project, to be added as each task begins.
Attachments
Attachments
Issue Links
- contains
-
SPARK-15672 R programming guide update
- Resolved
-
SPARK-14373 PySpark ml RandomForestClassifier, Regressor support export/import
- Resolved
-
SPARK-16137 Random Forest wrapper in SparkR
- Resolved
-
SPARK-11938 Expose numFeatures in all ML PredictionModel for PySpark
- Resolved
-
SPARK-15767 Decision Tree Regression wrapper in SparkR
- Resolved
-
SPARK-15490 SparkR 2.0 QA: New R APIs and API docs for non-MLib changes
- Resolved
- is related to
-
SPARK-13489 GSoC 2016 project ideas for MLlib
- Closed
- relates to
-
SPARK-15439 Failed to run unit test in SparkR
- Resolved
-
SPARK-14978 PySpark TrainValidationSplitModel should support validationMetrics
- Resolved