Uploaded image for project: 'Lucy'
  1. Lucy
  2. LUCY-132

Move Tokenizer, Stemmer, Stopalizer

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 0.1.0 (incubating)
    • 0.1.0 (incubating)
    • Analysis
    • None

    Description

      In anticipation of creating interfaces using generic names such as
      "Tokenizer", "Stemmer", and "StopFilter", we should vacate those class names
      and give the current implementations more specific names:

          Lucy::Analysis::Tokenizer  => Lucy::Analysis::RegexTokenizer
          Lucy::Analysis::Stemmer    => Lucy::Analysis::SnowballStemmer
          Lucy::Analysis::Stopalizer => Lucy::Analysis::SnowballStopFilter
      

      Discussion: <http://s.apache.org/RbX>, <http://s.apache.org/WVJ>.

      Attachments

        1. snowball_stopfilter.patch
          44 kB
          Marvin Humphrey
        2. snowball_stemmer.patch
          34 kB
          Marvin Humphrey
        3. regex_tokenizer.patch
          63 kB
          Marvin Humphrey

        Activity

          People

            marvin Marvin Humphrey
            marvin Marvin Humphrey
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: