Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7651

PySpark GMM predict, predictSoft should fail on bad input

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.3.0, 1.3.1, 1.4.0
    • Fix Version/s: 1.3.2, 1.4.0
    • Component/s: MLlib, PySpark
    • Labels:
      None

      Description

      In PySpark, GaussianMixtureModel predict and predictSoft test if the argument is an RDD and operate correctly if so. But if the argument is not an RDD, they fail silently, returning nothing.

      https://github.com/apache/spark/blob/11a1a135d1fe892cd48a9116acc7554846aed84c/python/pyspark/mllib/clustering.py#L176

      Instead, they should raise errors.

        Attachments

          Activity

            People

            • Assignee:
              MeethuMathew Meethu Mathew
              Reporter:
              josephkb Joseph K. Bradley
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: