Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-6654

KNearestNeighborClassifier not taking in consideration Class ranking

Details

    • Patch Available

    Description

      Currently the KNN Classifier assign the score for a ClassificationResult, based only on the frequency of the class in the top K results.

      This is conceptually a simplification.
      Actually the ranking must take a part.

      If not this can happen :

      Top 4
      1) Class1
      2) Class1
      3) Class2
      4) Class2

      As a result of this Top 4 , both the classes will have the same score.
      But the expected result is that Class1 has a better score, as the MLT score the documents accordingly.

      Attachments

        1. LUCENE-6654.patch
          12 kB
          Alessandro Benedetti

        Activity

          People

            teofili Tommaso Teofili
            abenedetti Alessandro Benedetti
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: