Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-12811

Estimator interface for generalized linear models (GLMs)

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • 2.0.0
    • 2.0.0
    • ML
    • None

    Description

      In Spark 1.6, MLlib provides logistic regression and linear regression with L1/L2/elastic-net regularization. We want to expand the support of generalized linear models (GLMs) in 2.0, e.g., Poisson/Gamma families and more link functions. SPARK-9835 implements a GLM solver for the case when the number of features is small. We also need to design an interface for GLMs.

      In SparkR, we can simply follow glm or glmnet. On the Python/Scala/Java side, the interface should be consistent with LinearRegression and LogisticRegression, e.g.,

      val glm = new GeneralizedLinearModel()
        .setFamily("poisson")
        .setSolver("irls")
      

      It would be great if LinearRegression and LogisticRegression can reuse code from GeneralizedLinearModel.

      Attachments

        Issue Links

          Activity

            People

              yanboliang Yanbo Liang
              mengxr Xiangrui Meng
              Xiangrui Meng Xiangrui Meng
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: