Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Not A Problem
-
3.6
-
None
-
Windows 7. (Development machine, not the server)
Description
I am using solr 3.6 and when I have multi-words synonyms the highlighting results have the wrong word highlighted.
If I have the below entry in the synonyms file:
dns, domain name system
If I index something like: "A sample dns entry explaining the details".
Searching for "name" (without quotes) in the highlight results/snippets I get : "A sample dns <em>entry</em> explaining the details". (The token "entry" overlaps with the token "name" in the analysis.jsp)
Searching for "system" (without quotes) in the highlight results/snippets I get : "A sample dns entry <em>explaining</em> the details". (The token "explaining" overlaps with the token "system" in the analysis.jsp)
Here is my schema field Type:
<fieldType name="text_general" class="solr.TextField" positionIncrementGap="100">
<analyzer type="index">
<charFilter class="solr.HTMLStripCharFilterFactory"/>
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.PorterStemFilterFactory"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.StandardTokenizerFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="false"/>
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.PorterStemFilterFactory"/>
</analyzer>
</fieldType>