Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-10008

CommonGramsFilterFactory doesn't respect ignoreCase=true when default stopwords are used

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 9.0, 8.11
    • None
    • None
    • New

    Description

      CommonGramsFilterFactory's use of the "words" and "ignoreCase" config options is inconsistent with how StopFilterFactory uses them - leading to "ignoreCase=true" not being respected unless "words" is specified...

      StopFilterFactory...

        public void inform(ResourceLoader loader) throws IOException {
          if (stopWordFiles != null) {
            ...
          } else {
            ...
            stopWords = new CharArraySet(EnglishAnalyzer.ENGLISH_STOP_WORDS_SET, ignoreCase);
          }
        }
      

      CommonGramsFilterFactory...

        @Override
        public void inform(ResourceLoader loader) throws IOException {
          if (commonWordFiles != null) {
            ...
          } else {
            commonWords = EnglishAnalyzer.ENGLISH_STOP_WORDS_SET;
          }
        }
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              hossman Chris M. Hostetter
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m