Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8510

Better maximum scores for boolean queries

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • New

    Description

      The maximum score of boolean queries is the sum of the maximum score of each clause. In the common case that each sub query runs on the same field, we might be able to get better estimates though. Say the query looks like "body:A body:B": if the maximum score of "body:A" is obtained for a document of length X and the maximum score of "body:B" is obtained for a document of length Y!=X then the sum of these maximum scores may never be reached: since both clauses have to match the same document, the maximum scores of each clause should be computed for the same value of the norm.

      Improving our APIs to improve this sounds challenging, but I was wondering that we could obtain almost the same result by enabling to sort by norm value, so that we would combine maximum scores of clauses that have the same norm.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jpountz Adrien Grand
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: