Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-3097

Introduce default Japanese stoptags and stopwords to Solr's example configuration

Agile BoardAttach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.6, 4.0-ALPHA
    • 3.6, 4.0-ALPHA
    • Schema and Analysis
    • None

    Description

      SOLR-3056 discusses introducing a default field type text_ja for Japanese in schema.xml. This configuration will be improved by also introducing default stopwords and stoptags configuration for the field type.

      I believe this configuration should be easily available and tunable to Solr users and I'm proposing that we introduce the same stopwords and stoptags provided in LUCENE-3745 to Solr example configuration. I'm proposing that files can live in solr/example/solr/conf as stopwords_ja.txt and stoptags_ja.txt alongside stopwords_en.txt for English. (Longer term, I think should reconsider our overall approach to this across all languages, but that's perhaps a separate discussion.)

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            cm Christian Moen
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment