Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-2628

use of FST for SynonymsFilterFactory and synonyms.txt

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Minor
    • Resolution: Duplicate
    • Affects Version/s: 3.4, 4.0-ALPHA
    • Fix Version/s: None
    • Component/s: Schema and Analysis
    • Labels:
    • Environment:

      Linux

      Description

      Currently the SynonymsFilterFactory builds up a memory based SynonymsMap.
      This can generate huge maps because of the permutations for synonyms.

      Now where FST (finite state transducer) is introduced to lucene this could also be used for synonyms.
      A tool can compile the synoynms.txt file to a binary automaton file which can then be used
      with SynoynmsFilterFactory.

      Advantage:

      • faster start of solr, no need to generate SynonymsMap
      • faster lookup
      • memory saving

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                dweiss Dawid Weiss
                Reporter:
                befehl Bernd Fehling
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: