[SPARK-3702] Standardize MLlib classes for learners, models - ASF JIRA

XML

Word

Printable

JSON

Summary: Create a class hierarchy for learning algorithms and the models those algorithms produce.

This is a super-task of several sub-tasks (but JIRA does not allow subtasks of subtasks). See the "requires" links below for subtasks.

Goals:

give intuitive structure to API, both for developers and for generated documentation
support meta-algorithms (e.g., boosting)
support generic functionality (e.g., evaluation)
reduce code duplication across classes

is depended upon by

SPARK-4591 Algorithm/model parity for spark.ml (Scala)

is related to

SPARK-3507 Create RegressionLearner trait and make some currect code implement it

SPARK-3251 Clarify learning interfaces

relates to

SPARK-10817 ML abstraction umbrella

requires

SPARK-4789 Standardize ML Prediction APIs

SPARK-5995 Make ML Prediction Developer APIs public

links to

[Github] Pull Request #3427 (jkbradley)

(1 requires, 1 links to)