Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1380

Patch for ShingleFilter.enablePositions (or PositionFilter)

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • None
    • 2.9
    • modules/analysis
    • None
    • New, Patch Available

    Description

      Make it possible for all words and shingles to be placed at the same position, that is for all shingles (and unigrams if included) to be treated as synonyms of each other.

      Today the shingles generated are synonyms only to the first term in the shingle.
      For example the query "abcd efgh ijkl" results in:
      ("abcd" "abcd efgh" "abcd efgh ijkl") ("efgh" efgh ijkl") ("ijkl")

      where "abcd efgh" and "abcd efgh ijkl" are synonyms of "abcd", and "efgh ijkl" is a synonym of "efgh".

      There exists no way today to alter which token a particular shingle is a synonym for.
      This patch takes the first step in making it possible to make all shingles (and unigrams if included) synonyms of each other.

      See http://comments.gmane.org/gmane.comp.jakarta.lucene.user/34746 for mailing list thread.

      Attachments

        1. LUCENE-1380-PositionFilter.patch
          9 kB
          Steven Rowe
        2. LUCENE-1380-PositionFilter.patch
          11 kB
          Michael Semb Wever
        3. LUCENE-1380-PositionFilter.patch
          9 kB
          Michael Semb Wever
        4. LUCENE-1380.patch
          6 kB
          Michael Semb Wever
        5. LUCENE-1380.patch
          6 kB
          Karl Wettin

        Issue Links

          Activity

            People

              gsingers Grant Ingersoll
              mck Michael Semb Wever
              Votes:
              14 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: