Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8087

Record per-term max term frequencies

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Resolved
    • Minor
    • Resolution: Abandoned
    • None
    • None
    • None
    • None
    • New

    Description

      I was mostly interested in doing that in order to get better score upper bounds for LUCENE-4100. However this doesn't help, at least with the tasks that we have for wikimedium10m. I dug this a bit, and this is due to the fact that the upper bound is not much better if we can't make assumptions about the value of the length. Ideally we'd need something like the maximum term frequency for each norm value. I'll post the patch in case someone has another use-case for per-term max term frequencies.

      Attachments

        1. LUCENE-8087.patch
          74 kB
          Adrien Grand

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jpountz Adrien Grand
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: