Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7651

PySpark GMM predict, predictSoft should fail on bad input

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.3.0, 1.3.1, 1.4.0
    • 1.3.2, 1.4.0
    • MLlib, PySpark
    • None

    Description

      In PySpark, GaussianMixtureModel predict and predictSoft test if the argument is an RDD and operate correctly if so. But if the argument is not an RDD, they fail silently, returning nothing.

      https://github.com/apache/spark/blob/11a1a135d1fe892cd48a9116acc7554846aed84c/python/pyspark/mllib/clustering.py#L176

      Instead, they should raise errors.

      Attachments

        Activity

          People

            MeethuMathew Meethu Mathew
            josephkb Joseph K. Bradley
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: