Description
In Spark 1.6, MLlib provides logistic regression and linear regression with L1/L2/elastic-net regularization. We want to expand the support of generalized linear models (GLMs) in 2.0, e.g., Poisson/Gamma families and more link functions. SPARK-9835 implements a GLM solver for the case when the number of features is small. We also need to design an interface for GLMs.
In SparkR, we can simply follow glm or glmnet. On the Python/Scala/Java side, the interface should be consistent with LinearRegression and LogisticRegression, e.g.,
val glm = new GeneralizedLinearModel() .setFamily("poisson") .setSolver("irls")
It would be great if LinearRegression and LogisticRegression can reuse code from GeneralizedLinearModel.
Attachments
Issue Links
- is depended upon by
-
SPARK-13597 Python API for GeneralizedLinearRegression
- Resolved
- is related to
-
SPARK-12566 GLM model family, link function support in SparkR:::glm
- Resolved
- relates to
-
SPARK-9835 Iteratively reweighted least squares solver for GLMs
- Resolved
- links to