Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8087

Record per-term max term frequencies

    XMLWordPrintableJSON

    Details

    • Type: Wish
    • Status: Resolved
    • Priority: Minor
    • Resolution: Abandoned
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      I was mostly interested in doing that in order to get better score upper bounds for LUCENE-4100. However this doesn't help, at least with the tasks that we have for wikimedium10m. I dug this a bit, and this is due to the fact that the upper bound is not much better if we can't make assumptions about the value of the length. Ideally we'd need something like the maximum term frequency for each norm value. I'll post the patch in case someone has another use-case for per-term max term frequencies.

        Attachments

        1. LUCENE-8087.patch
          74 kB
          Adrien Grand

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                jpountz Adrien Grand
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: