Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-577

SweetSpotSimiliarity

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 2.0.0
    • core/search
    • None

    Description

      This is a new Similarity implimention for the contrib/miscellaneous/ package, it provides a Similiarty designed for people who know the "sweetspot" of their data. three major pieces of functionality are included:

      1) a lengthNorm which creates a "platuea" of values.
      2) a baseline tf that provides a fixed value for tf's up to a minimum, at which point it becomes a sqrt curve (this is used by the tf(int) function.
      3) a hyperbolic tf function which is best explained by graphing the equation. this isn't used by default, but is available for subclasses to call from their own tf functions.

      All constants used in all functions are configurable. In the case of lengthNorm, the constants are configurable per field, as well as allowing for defaults for unspecified fields.

      Attachments

        1. ss.patch
          12 kB
          Chris M. Hostetter

        Activity

          People

            hossman Chris M. Hostetter
            hossman Chris M. Hostetter
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: