Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2557

FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

Details

    • Bug
    • Status: Reopened
    • Major
    • Resolution: Unresolved
    • 3.0.2
    • None
    • core/query/scoring
    • None
    • New

    Description

      The FuzzyQuery often causes misspellings to be ranked higher than the exact match, which seems to be an undesirable property generally.

      For example, in an index of surnames, if I search using a FuzzyQuery for "smith", the misspellings such as "smiith", or "smiht" would appear near the top of the search results ahead of documents that match "smith".

      Attachments

        1. LUCENE-2557.patch
          7 kB
          Jingkei Ly
        2. idf-scoring-test-case.patch
          3 kB
          Jingkei Ly

        Activity

          People

            Unassigned Unassigned
            jly Jingkei Ly
            Votes:
            1 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: