Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4648

PostingsHighlighter should score only with within-document statistics

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 4.1
    • Fix Version/s: 4.1, 6.0
    • Component/s: modules/other
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      PostingsHighlighter (in sandbox) treats documents to be highlighted as collections of small documents (snippets).

      But (like the other highlighters), it uses collection-wide stats to score the snippets. It actually calls the indexsearcher method which might even result in a distributed call. I don't think its worth it to do all this. It means we have to reseek to all terms across all segments at the minimum, and it means snippets change over time and so on.

      I think its good enough to only use within-document stats. And it might speed up highlighting.

        Attachments

        1. LUCENE-4648.patch
          7 kB
          Robert Muir

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              rcmuir Robert Muir
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: