Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-17133 Improvements to linear methods in Spark
  3. SPARK-21245

Resolve code duplication for classification/regression summarizers

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • 2.2.2
    • None
    • ML

    Description

      In several places (LogReg, LinReg, SVC) in Spark ML, we collect summary information about training data using MultivariateOnlineSummarizer and MulticlassSummarizer. We have the same code appearing in several places (and including test suites). We can eliminate this by creating a common implementation somewhere.

      Attachments

        Activity

          People

            Unassigned Unassigned
            sethah Seth Hendrickson
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: