Uploaded image for project: 'Apache MADlib'
  1. Apache MADlib
  2. MADLIB-927

Initial implementation of k-NN

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • v1.10
    • None

    Description

      k-Nearest Neighbors is a simple algorithm based on finding nearest neighbors of data points in a metric feature space according to a specified distance function. It is considered one of the canonical algorithms of data science. It is a nonparametric method, which makes it applicable to a lot of real-world problems where the data doesn’t satisfy particular distribution assumptions. It can also be implemented as a lazy algorithm, which means there is no training phase where information in the data is condensed into coefficients, but there is a costly testing phase where all data (or some subset) is used to make predictions.

      This JIRA involves implementing the naïve approach - i.e. compute the k nearest neighbors by going through all points.

      Attachments

        Issue Links

          Activity

            People

              hpandey12 Himanshu Pandey
              riyer Rahul Iyer
              Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: