Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
1.4
-
None
Description
To see the problem first hand, add to schema.xml:
<fieldType name="cap" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <filter class="solr.CapitalizationFilterFactory" minWordLength="2"/> </analyzer> </fieldType>
Then in analysis.jsp, analyze "a or and". All terms get capitalized.
This has to do with the length calculation being wrong in CapitalizationFilter#next.