Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-1406

PMML model evaluation support via MLib

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.4.0
    • Component/s: MLlib
    • Labels:
      None
    • Target Version/s:

      Description

      It would be useful if spark would provide support the evaluation of PMML models (http://www.dmg.org/v4-2/GeneralStructure.html).

      This would allow to use analytical models that were created with a statistical modeling tool like R, SAS, SPSS, etc. with Spark (MLib) which would perform the actual model evaluation for a given input tuple. The PMML model would then just contain the "parameterization" of an analytical model.

      Other projects like JPMML-Evaluator do a similar thing.
      https://github.com/jpmml/jpmml/tree/master/pmml-evaluator

        Attachments

        1. kmeans.xml
          2 kB
          Vincenzo Selvaggio
        2. SPARK-1406.pdf
          187 kB
          Vincenzo Selvaggio
        3. MyJPMMLEval.java
          2 kB
          Vincenzo Selvaggio
        4. SPARK-1406_v2.pdf
          199 kB
          Vincenzo Selvaggio

          Issue Links

            Activity

              People

              • Assignee:
                selvinsource Vincenzo Selvaggio
                Reporter:
                thomasd Thomas Darimont
              • Votes:
                7 Vote for this issue
                Watchers:
                29 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: