Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13590

Document the behavior of spark.ml logistic regression and AFT survival regression when there are constant features

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.0.0
    • 2.0.0
    • ML
    • None

    Description

      As discussed in SPARK-13029, we decided to keep the current behavior that sets all coefficients associated with constant feature columns to zero, regardless of intercept, regularization, and standardization settings. This is the same behavior as in glmnet. Since this is different from LIBSVM, we should document the behavior correctly, add tests, and generate warning messages if there are constant columns and `addIntercept` is false.

      cc coderxiang dbtsai

      Attachments

        Issue Links

          Activity

            People

              yanboliang Yanbo Liang
              mengxr Xiangrui Meng
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: