Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-14643

Edit Ref Guide about How to Configure Tokenizing Managed Synonyms

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • documentation
    • None

    Description

      SynonymGraphFilter and SynonymFileter can configure a tokenizer to tokenize synonyms in text files. 

      The configuration is below:

       
      <filter class="solr.SynonymGraphFilterFactory"
                  ignoreCase="true"
                  synonyms="synonyms.txt"
                  tokenizerFactory="solr.[Name of Tokenizer]"/>

      For ManagedSynonymGraphFilter, even though it can configure tokenizer in the Managed Resource JSON file, there does not exists any explanation about tokenizer configuration. 
      For users to choose ManagedSynonymGraphFilter seamlessly, Solr reference guide should contain the configuration guide. 
       
      When a user want to tokenize their synonyms, which is used by ManagedSynonymGraphFilter, then the user should edit Managed Resource JSON like below. 

      { "responseHeader":

      {   "status":0, "QTime":3}


        "synonymMappings":{ "
       initArgs":

      {      "ignoreCase":true,        "format":"solr",        "tokenizerFactory":"solr.<Name Of Tokenizer>Factory"    }


            "initializedOn":"2014-12-16T22:44:05.33Z", 
           "managedMap":

      {          "GB": ["GiB", "Gigabyte"],         "TV": ["Television"],         "happy": ["glad", "joyful"]}

      }}
       
      Also, it might be nice if we could edit Managed Resource JSON via REST API.
      (This could be another issue.)
       

      Attachments

        Activity

          People

            Unassigned Unassigned
            kayak28 kayak28
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: