[LUCENE-6654] KNearestNeighborClassifier not taking in consideration Class ranking - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Patch Available
Priority: Minor
Resolution: Fixed
Affects Version/s: 5.2.1
Fix Version/s: 5.3, 6.0
Component/s: modules/classification
Labels:
- classification
- knn

Lucene Fields:

Patch Available

Description

Currently the KNN Classifier assign the score for a ClassificationResult, based only on the frequency of the class in the top K results.

This is conceptually a simplification.
Actually the ranking must take a part.

If not this can happen :

Top 4
1) Class1
2) Class1
3) Class2
4) Class2

As a result of this Top 4 , both the classes will have the same score.
But the expected result is that Class1 has a better score, as the MLT score the documents accordingly.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-6654.patch
12/Jul/15 19:22
12 kB
Alessandro Benedetti

Activity

People

Assignee:: Tommaso Teofili

Reporter:: Alessandro Benedetti

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 02/Jul/15 13:10

Updated:: 19/Sep/24 09:37

Resolved:: 03/Aug/15 14:05