Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13590

Document the behavior of spark.ml logistic regression and AFT survival regression when there are constant features

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 2.0.0
    • Fix Version/s: 2.0.0
    • Component/s: ML
    • Labels:
      None

      Description

      As discussed in SPARK-13029, we decided to keep the current behavior that sets all coefficients associated with constant feature columns to zero, regardless of intercept, regularization, and standardization settings. This is the same behavior as in glmnet. Since this is different from LIBSVM, we should document the behavior correctly, add tests, and generate warning messages if there are constant columns and `addIntercept` is false.

      cc Shuo Xiang DB Tsai

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                yanboliang Yanbo Liang
                Reporter:
                mengxr Xiangrui Meng
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: