Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31681

Python multiclass logistic regression evaluate should return LogisticRegressionSummary

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 3.0.0
    • Fix Version/s: 3.0.0
    • Component/s: ML, PySpark
    • Labels:
    • Docs Text:
      Hide
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.
      Show
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.

      Description

          def evaluate(self, dataset):
              ......
              java_blr_summary = self._call_java("evaluate", dataset)
              return BinaryLogisticRegressionSummary(java_blr_summary)
      

      We should return LogisticRegressionSummary instead of BinaryLogisticRegressionSummary for multiclass LogisticRegression

        Attachments

          Activity

            People

            • Assignee:
              huaxingao Huaxin Gao
              Reporter:
              huaxingao Huaxin Gao
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: