Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-29118

Avoid redundant computation in GMM.transform && GLR.transform

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.0.0
    • 3.0.0
    • ML
    • None

    Description

      In SPARK-27944, the computation for output columns with empty name is skipped.

      Now, I find that we can furthermore optimize

      1, GMM.transform by directly obtaining the prediction(double) from its probabilty prediction(vector), like what ProbabilisticClassificationModel and ClassificationModel do.

      2, GLR.transform by obtaining the prediction(double) from its link prediction(double)

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            podongfeng Ruifeng Zheng
            podongfeng Ruifeng Zheng
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment