Lucene - Core
  1. Lucene - Core
  2. LUCENE-2557

FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

    Details

    • Type: Bug Bug
    • Status: Reopened
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 3.0.2
    • Fix Version/s: None
    • Component/s: core/query/scoring
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      The FuzzyQuery often causes misspellings to be ranked higher than the exact match, which seems to be an undesirable property generally.

      For example, in an index of surnames, if I search using a FuzzyQuery for "smith", the misspellings such as "smiith", or "smiht" would appear near the top of the search results ahead of documents that match "smith".

      1. LUCENE-2557.patch
        7 kB
        Jingkei Ly
      2. idf-scoring-test-case.patch
        3 kB
        Jingkei Ly

        Activity

        Mark Thomas made changes -
        Workflow Default workflow, editable Closed status [ 12562721 ] jira [ 12583650 ]
        Mark Thomas made changes -
        Workflow jira [ 12516431 ] Default workflow, editable Closed status [ 12562721 ]
        Jingkei Ly made changes -
        Attachment LUCENE-2557.patch [ 12450325 ]
        Jingkei Ly made changes -
        Resolution Duplicate [ 3 ]
        Status Resolved [ 5 ] Reopened [ 4 ]
        Robert Muir made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Resolution Duplicate [ 3 ]
        Jingkei Ly made changes -
        Field Original Value New Value
        Attachment idf-scoring-test-case.patch [ 12450320 ]
        Jingkei Ly created issue -

          People

          • Assignee:
            Unassigned
            Reporter:
            Jingkei Ly
          • Votes:
            1 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development