Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4648

PostingsHighlighter should score only with within-document statistics

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 4.1
    • 4.1, 6.0
    • modules/other
    • None
    • New

    Description

      PostingsHighlighter (in sandbox) treats documents to be highlighted as collections of small documents (snippets).

      But (like the other highlighters), it uses collection-wide stats to score the snippets. It actually calls the indexsearcher method which might even result in a distributed call. I don't think its worth it to do all this. It means we have to reseek to all terms across all segments at the minimum, and it means snippets change over time and so on.

      I think its good enough to only use within-document stats. And it might speed up highlighting.

      Attachments

        1. LUCENE-4648.patch
          7 kB
          Robert Muir

        Activity

          People

            Unassigned Unassigned
            rcmuir Robert Muir
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: