Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-14450

Python OneVsRest should train multiple models at once

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • ML, PySpark
    • None

    Description

      SPARK-7861 adds a Python wrapper for OneVsRest. Because of possible issues related to using existing libraries like multiprocessing, we are not training multiple models in parallel initially.

      This issue is for prototyping, testing, and implementing a way to train multiple models at once. Speaking with joshrosen, a good option might be the concurrent.futures package:

      We will not add this for Spark 2.0, but it will be good to investigate for 2.1.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              josephkb Joseph K. Bradley
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: