Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-1856 Standardize MLlib interfaces
  3. SPARK-5995

Make ML Prediction Developer APIs public

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.3.0
    • 1.4.0
    • ML
    • None

    Description

      Previously, some Developer APIs were added to spark.ml for classification and regression to make it easier to add new algorithms and models: SPARK-4789 There are ongoing discussions about the best design of the API. This JIRA is to continue that discussion and try to finalize those Developer APIs so that they can be made public.

      Please see this design doc from SPARK-4789 for details on the original API design.

      Some issues under debate:

      • Should there be strongly typed APIs for fit()?
        • Proposal: No
      • Should the strongly typed API for transform() be public (vs. protected)?
        • Proposal: Protected for now
      • What transformation methods should the API make developers implement for classification?
        • Proposal: See design doc
      • Should there be a way to transform a single Row (instead of only DataFrames)?
        • Proposal: Not for now

      Attachments

        Issue Links

          Activity

            People

              josephkb Joseph K. Bradley
              josephkb Joseph K. Bradley
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: