Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-132

Add "an" to the English stop words

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: modules/analysis
    • Labels:
      None
    • Environment:

      Operating System: All
      Platform: PC

    • Bugzilla Id:
      28960

      Description

      In org.apache.lucene.analysis.StopAnalyzer, the ENGLISH_STOP_WORDS array
      contains "a" but not "an". So searching for "a fund" will get the same hits as
      "fund", but searching for "an investment" will get many more hits than "investment".

      This is true in the latest revision of the file, but appears to have always been
      the case. I'm amazed nobody's pointed it out before now, our users had only
      been testing for a few hours before they complained about it

        Attachments

          Activity

            People

            • Assignee:
              java-dev@lucene.apache.org Lucene Developers
              Reporter:
              ats37@hotmail.com Andrew Stevens
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: