Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31681

Python multiclass logistic regression evaluate should return LogisticRegressionSummary

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • ML, PySpark
    • Hide
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.
      Show
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.

    Description

          def evaluate(self, dataset):
              ......
              java_blr_summary = self._call_java("evaluate", dataset)
              return BinaryLogisticRegressionSummary(java_blr_summary)
      

      We should return LogisticRegressionSummary instead of BinaryLogisticRegressionSummary for multiclass LogisticRegression

      Attachments

        Activity

          People

            huaxingao Huaxin Gao
            huaxingao Huaxin Gao
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: