[SOLR-1852] enablePositionIncrements="true" can cause searches to fail when they are parsed as phrase queries - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.4
Fix Version/s: 1.4.1
Component/s: None
Labels:
None

Description

Symptom: searching for a string like a domain name containing a '.', the Solr 1.4 analyzer tells me that I will get a match, but when I enter the search either in the client or directly in Solr, the search fails.
test string: Identi.ca

queries that fail: IdentiCa, Identi.ca, Identi-ca

query that matches: Identi ca

schema in use is:
http://drupalcode.org/viewvc/drupal/contributions/modules/apachesolr/schema.xml?revision=1.1.2.1.2.34&content-type=text%2Fplain&view=co&pathrev=DRUPAL-6--1

Screen shots:

analysis: http://img.skitch.com/20100327-nt1uc1ctykgny28n8bgu99h923.png
dismax search: http://img.skitch.com/20100327-byiduuiry78caka7q5smsw7fp.png
dismax search: http://img.skitch.com/20100327-gckm8uhjx3t7px31ygfqc2ugdq.png
standard search: http://img.skitch.com/20100327-usqyqju1d12ymcpb2cfbtdwyh.png

Whether or not the bug appears is determined by the surrounding text:

"would be great to have support for Identi.ca on the follow block"

fails to match "Identi.ca", but putting the content on its own or in another sentence:

"Support Identi.ca"

the search matches. Testing suggests the word "for" is the problem, and it looks like the bug occurs when a stop word preceeds a word that is split up using the word delimiter filter.

Setting enablePositionIncrements="false" in the stop filter and reindexing causes the searches to match.

According to Mark Miller in #solr, this bug appears to be fixed already in Solr trunk, either due to the upgraded lucene or changes to the WordDelimiterFactory

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-1852.patch
27/Mar/10 23:36
51 kB
Peter Wolanin
SOLR-1852_testcase.patch
28/Mar/10 13:03
2 kB
Robert Muir
SOLR-1852_solr14branch.patch
02/Jun/10 03:35
54 kB
Robert Muir

Activity

People

Assignee:: Robert Muir

Reporter:: Peter Wolanin

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 27/Mar/10 23:04

Updated:: 16/Sep/10 19:59

Resolved:: 02/Jun/10 18:35