Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-1203

k-NN Interface changes for classification and regression

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • k-NN
    • None

    Description

      k-NN has a single function for both classification and regression. To be consistent with other modules such as MLP and SVM, can we instead have two functions:

      knn_classification and knn_regression

      A couple of other usability changes:

      1) The first 7 parameters for the current knn implementation deal with providing some details about the training table and test table. Can we instead have two parameters, one for training table and the other for test table instead of those 7 params? We could have a comma separated key-value pair like in the optimization params for elastic net. 

      2) The output table currently has the `id` and `point` columns among others. The `point` column in redundant since the `id` is a unique identifier of a row. We could remove the `point` column from the output table.

      Attachments

        Activity

          People

            Unassigned Unassigned
            njayaram Nandish Jayaram
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: