Lucene - Core
  1. Lucene - Core
  2. LUCENE-2557

FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

    Details

    • Type: Bug Bug
    • Status: Reopened
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 3.0.2
    • Fix Version/s: None
    • Component/s: core/query/scoring
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      The FuzzyQuery often causes misspellings to be ranked higher than the exact match, which seems to be an undesirable property generally.

      For example, in an index of surnames, if I search using a FuzzyQuery for "smith", the misspellings such as "smiith", or "smiht" would appear near the top of the search results ahead of documents that match "smith".

      1. LUCENE-2557.patch
        7 kB
        Jingkei Ly
      2. idf-scoring-test-case.patch
        3 kB
        Jingkei Ly

        Activity

          People

          • Assignee:
            Unassigned
            Reporter:
            Jingkei Ly
          • Votes:
            1 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development