Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8510

Better maximum scores for boolean queries

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      The maximum score of boolean queries is the sum of the maximum score of each clause. In the common case that each sub query runs on the same field, we might be able to get better estimates though. Say the query looks like "body:A body:B": if the maximum score of "body:A" is obtained for a document of length X and the maximum score of "body:B" is obtained for a document of length Y!=X then the sum of these maximum scores may never be reached: since both clauses have to match the same document, the maximum scores of each clause should be computed for the same value of the norm.

      Improving our APIs to improve this sounds challenging, but I was wondering that we could obtain almost the same result by enabling to sort by norm value, so that we would combine maximum scores of clauses that have the same norm.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              jpountz Adrien Grand
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: