Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-17133 Improvements to linear methods in Spark
  3. SPARK-17151

Decide how to handle inferring number of classes in Multinomial logistic regression

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • None
    • None
    • ML, MLlib

    Description

      This JIRA is to discuss how the number of label classes should be inferred in multinomial logistic regression. Currently, MLOR checks the dataframe metadata and if the number of classes is not specified then it uses the maximum value seen in the label column. If the labels are not properly indexed, then this can cause a large number of zero coefficients and potentially produce instabilities in model training.

      Attachments

        Activity

          People

            Unassigned Unassigned
            sethah Seth Hendrickson
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: