Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8343

BlendedInfixSuggester bad score calculus for certain suggestion weights

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 7.3.1
    • Fix Version/s: 8.0
    • Component/s: core/search
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      Currently the BlendedInfixSuggester return a (long) score to rank the suggestions.
      This score is calculated as a multiplication between :

      long Weight : the suggestion weight, coming from a document field, it can be any long value ( including 1, 0,.. )

      double Coefficient : 0<=x<=1, calculated based on the position match, earlier the better

      The resulting score is a long, which means that at the moment, any weight<10 can bring inconsistencies.

      Edge cases 
      Weight =1
      Score = 1( if we have a match at the beginning of the suggestion) or 0 ( for any other match)

      Weight =0
      Score = 0 ( independently of the position match coefficient)

        Attachments

        1. LUCENE-8343.patch
          155 kB
          Alessandro Benedetti
        2. LUCENE-8343.patch
          151 kB
          Alessandro Benedetti
        3. LUCENE-8343.patch
          129 kB
          Alessandro Benedetti
        4. LUCENE-8343.patch
          8 kB
          Alessandro Benedetti
        5. LUCENE-8343.patch
          10 kB
          Alessandro Benedetti
        6. LUCENE-8343.patch
          8 kB
          Alessandro Benedetti

        Issue Links

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              abenedetti Alessandro Benedetti

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 0.5h
                0.5h

                  Issue deployment