Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1192

solr.NGramFilterFactory stops to index the content if it find a token smaller than minim ngram size

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.3
    • Fix Version/s: 1.4
    • Component/s: Schema and Analysis
    • Labels:
      None
    • Environment:

      any

      Description

      If a field is split in tokens (by a tokenizer) and after that is aplied the NGramFilterFactory for these tokens...the indexing goes well while the length of the tokens is greater or equal with minim ngram size (ussually is 3). Otherwise the indexing breaks in this point and the rest of tokens are no more indexed. This behaviour can be easy observed with the analysis tool which is in Solr admin interface.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                otis Otis Gospodnetic
                Reporter:
                viobade viobade
              • Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: