Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-132

Add "an" to the English stop words

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • None
    • modules/analysis
    • None
    • Operating System: All
      Platform: PC

    • 28960

    Description

      In org.apache.lucene.analysis.StopAnalyzer, the ENGLISH_STOP_WORDS array
      contains "a" but not "an". So searching for "a fund" will get the same hits as
      "fund", but searching for "an investment" will get many more hits than "investment".

      This is true in the latest revision of the file, but appears to have always been
      the case. I'm amazed nobody's pointed it out before now, our users had only
      been testing for a few hours before they complained about it

      Attachments

        Activity

          People

            java-dev@lucene.apache.org Lucene Developers
            ats37@hotmail.com Andrew Stevens
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: