Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-6184

BooleanScorer should better deal with sparse clauses

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 5.1, 6.0
    • None
    • None
    • New

    Description

      The way that BooleanScorer works looks like this:

      for each (window of 2048 docs) {
        for each (optional scorer) {
          scorer.score(window)
        }
      }
      

      This is not efficient for very sparse clauses (doc freq much lower than maxDoc/2048) since we keep on scoring windows of documents that do not match anything. BooleanScorer2 currently performs better in those cases.

      Attachments

        1. LUCENE-6184.patch
          28 kB
          Adrien Grand
        2. LUCENE-6184.patch
          16 kB
          Adrien Grand
        3. LUCENE-6184.patch
          16 kB
          Adrien Grand

        Activity

          People

            jpountz Adrien Grand
            jpountz Adrien Grand
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: