Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7409

Designing multilabel abstractions for spark.ml

    XMLWordPrintableJSON

Details

    • Brainstorming
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • None
    • None
    • ML

    Description

      This JIRA is for discussing how to support multi-label prediction in the Pipelines API (spark.ml package). Some issues to figure out are:

      • Should there be abstractions?
        • How should they relate to the existing single-label abstractions: Predictor, Classifier, Regressor?
        • How much code sharing can the abstractions provide?
      • How should we support a mix of categorical and real-valued labels?
      • How do we support structure among the labels? There could be no known structure, a graphical structure, a chain structure, etc., depending on the application/model.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              josephkb Joseph K. Bradley
              Votes:
              2 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: