Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3574

Create a Compound Word Filter (and Factory) extension that will allow support for (word) exceptions

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      When having the following use case:
      We have 2 words "penslot" and "knoppen". One of them presents a compound word ("penslot"), the other one is a plural form of knop.

      When using the compound word filter, if we place the words "pen" "slot" and "knop" in the dictionary, for a search containing "knoppen", we get results containing "pen" also, which shouldn't be the case, because "knoppen" is only a plural form (not a compound word).

      We need another dictionary to specify the words that are exceptions to the filter (like in this case "knoppen"). In this case, the filter would find compound words containing "pen" "slot" and "knop", but will leave out dividing "knoppen" and searching on its parts.

      More info on the subject: http://stackoverflow.com/questions/11159839/can-we-make-the-compound-word-filter-not-divide-some-words-in-solr

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            despot Despot Jakimovski
            despot Despot Jakimovski

            Dates

              Created:
              Updated:

              Time Tracking

                Estimated:
                Original Estimate - 72h
                72h
                Remaining:
                Remaining Estimate - 72h
                72h
                Logged:
                Time Spent - Not Specified
                Not Specified

                Slack

                  Issue deployment