Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1918

Bit-wise scoring field type

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1
    • None
    • Schema and Analysis
    • None

    Description

      This field type supports bit-wise matching with scoring over fields containing 32-bit integer values.

      Internally during indexing the field value is interpreted as a 32-bit integer and converted to a set of bitmask terms. During search the query term is also converted into a BooleanQuery of corresponding bitmask terms, added as Occur.SHOULD clauses. The scoring of such query over such field produces the desired effect, where bit patterns closest to the query get highest scores.

      NOTE: currently only single value per field, a future extension will support multiple values.

      The difference between this issue and SOLR-1913 is that SOLR-1913 offers filtering only (no scoring), while this implementation performs scoring, which allows finding the best matching inexact bit patterns. This has applications in e.g. near-duplicate detection.

      Attachments

        1. bitwise.patch
          11 kB
          Andrzej Bialecki

        Activity

          People

            Unassigned Unassigned
            ab Andrzej Bialecki
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: