Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31681

Python multiclass logistic regression evaluate should return LogisticRegressionSummary

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • ML, PySpark
    • Hide
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.
      Show
      In Spark 3.0, a multiclass logistic regression in Pyspark will now (correctly) return LogisticRegressionSummary, not the subclass BinaryLogisticRegressionSummary. The additional methods exposed by BinaryLogisticRegressionSummary would not work in this case anyway.

    Description

          def evaluate(self, dataset):
              ......
              java_blr_summary = self._call_java("evaluate", dataset)
              return BinaryLogisticRegressionSummary(java_blr_summary)
      

      We should return LogisticRegressionSummary instead of BinaryLogisticRegressionSummary for multiclass LogisticRegression

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            huaxingao Huaxin Gao
            huaxingao Huaxin Gao
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment