Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-907

Prediction Metrics

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • v1.9.1
    • Module: Utilities
    • None

    Description

      Story

      As a data scientist, I want to compute prediction metrics on my data, so that I can gauge model accuracy based on predicted values vs. actual values.

      1) The PDL Tools modules "Prediction Metrics" [1] is an example of what could be ported to MADlib. Source code is located at [2].

      2) Here is functionality from PDL tools to use as a starting point:

      mf_mae
      Mean Absolute Error.

      mf_mape
      Mean Absolute Percentage Error.

      mf_mpe
      Mean Percentage Error.

      mf_rmse
      Root Mean Square Error.

      mf_r2
      R-squared.

      mf_adjusted_r2
      Adjusted R-squared.

      mf_binary_classifier
      Metrics for binary classification.

      mf_auc
      Area under the ROC curve (in binary classification).

      mf_confusion_matrix
      Confusion matrix for a multi-class classifier.

      References

      [1] PDL Tools Prediction Metrics module
      http://pivotalsoftware.github.io/PDLTools/group__grp__prediction__metrics.html
      [2] PDL tools source code
      https://github.com/pivotalsoftware/PDLTools

      Attachments

        1. MADlib_ Prediction_Metrics_user_doc_v2.pdf
          196 kB
          Frank McQuillan
        2. interface_v3.sql
          9 kB
          Orhan Kislal
        3. interface_v1.sql
          5 kB
          Orhan Kislal

        Issue Links

          Activity

            People

              okislal Orhan Kislal
              fmcquillan Frank McQuillan
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: