Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1380

Patch for ShingleFilter.enablePositions (or PositionFilter)

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.9
    • Component/s: modules/analysis
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      Make it possible for all words and shingles to be placed at the same position, that is for all shingles (and unigrams if included) to be treated as synonyms of each other.

      Today the shingles generated are synonyms only to the first term in the shingle.
      For example the query "abcd efgh ijkl" results in:
      ("abcd" "abcd efgh" "abcd efgh ijkl") ("efgh" efgh ijkl") ("ijkl")

      where "abcd efgh" and "abcd efgh ijkl" are synonyms of "abcd", and "efgh ijkl" is a synonym of "efgh".

      There exists no way today to alter which token a particular shingle is a synonym for.
      This patch takes the first step in making it possible to make all shingles (and unigrams if included) synonyms of each other.

      See http://comments.gmane.org/gmane.comp.jakarta.lucene.user/34746 for mailing list thread.

        Attachments

        1. LUCENE-1380-PositionFilter.patch
          9 kB
          mck
        2. LUCENE-1380-PositionFilter.patch
          11 kB
          mck
        3. LUCENE-1380-PositionFilter.patch
          9 kB
          Steve Rowe
        4. LUCENE-1380.patch
          6 kB
          Karl Wettin
        5. LUCENE-1380.patch
          6 kB
          mck

          Issue Links

            Activity

              People

              • Assignee:
                gsingers Grant Ingersoll
                Reporter:
                michaelsembwever mck
              • Votes:
                14 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: