Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-7776

Switch KNN classifier to use BM25 similarity

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 7.0
    • Component/s: modules/classification
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      It'd be good to use BM25 as default Similarity for KNN classifier.
      Having done some tests on the 20newsgroups dataset that resulted in improved f1 (between 0.10 and 0.15).

        Attachments

          Activity

            People

            • Assignee:
              teofili Tommaso Teofili
              Reporter:
              teofili Tommaso Teofili

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment