Solr
  1. Solr
  2. SOLR-1852

enablePositionIncrements="true" can cause searches to fail when they are parsed as phrase queries

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1.4
    • Fix Version/s: 1.4.1
    • Component/s: None
    • Labels:
      None

      Description

      Symptom: searching for a string like a domain name containing a '.', the Solr 1.4 analyzer tells me that I will get a match, but when I enter the search either in the client or directly in Solr, the search fails.
      test string: Identi.ca

      queries that fail: IdentiCa, Identi.ca, Identi-ca

      query that matches: Identi ca

      schema in use is:
      http://drupalcode.org/viewvc/drupal/contributions/modules/apachesolr/schema.xml?revision=1.1.2.1.2.34&content-type=text%2Fplain&view=co&pathrev=DRUPAL-6--1

      Screen shots:

      analysis: http://img.skitch.com/20100327-nt1uc1ctykgny28n8bgu99h923.png
      dismax search: http://img.skitch.com/20100327-byiduuiry78caka7q5smsw7fp.png
      dismax search: http://img.skitch.com/20100327-gckm8uhjx3t7px31ygfqc2ugdq.png
      standard search: http://img.skitch.com/20100327-usqyqju1d12ymcpb2cfbtdwyh.png

      Whether or not the bug appears is determined by the surrounding text:

      "would be great to have support for Identi.ca on the follow block"

      fails to match "Identi.ca", but putting the content on its own or in another sentence:

      "Support Identi.ca"

      the search matches. Testing suggests the word "for" is the problem, and it looks like the bug occurs when a stop word preceeds a word that is split up using the word delimiter filter.

      Setting enablePositionIncrements="false" in the stop filter and reindexing causes the searches to match.

      According to Mark Miller in #solr, this bug appears to be fixed already in Solr trunk, either due to the upgraded lucene or changes to the WordDelimiterFactory

      1. SOLR-1852_solr14branch.patch
        54 kB
        Robert Muir
      2. SOLR-1852_testcase.patch
        2 kB
        Robert Muir
      3. SOLR-1852.patch
        51 kB
        Peter Wolanin

        Activity

          People

          • Assignee:
            Robert Muir
            Reporter:
            Peter Wolanin
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development