Uploaded image for project: 'Lucy'
  1. Lucy
  2. LUCY-132

Move Tokenizer, Stemmer, Stopalizer

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 0.1.0 (incubating)
    • Fix Version/s: 0.1.0 (incubating)
    • Component/s: Analysis
    • Labels:
      None

      Description

      In anticipation of creating interfaces using generic names such as
      "Tokenizer", "Stemmer", and "StopFilter", we should vacate those class names
      and give the current implementations more specific names:

          Lucy::Analysis::Tokenizer  => Lucy::Analysis::RegexTokenizer
          Lucy::Analysis::Stemmer    => Lucy::Analysis::SnowballStemmer
          Lucy::Analysis::Stopalizer => Lucy::Analysis::SnowballStopFilter
      

      Discussion: <http://s.apache.org/RbX>, <http://s.apache.org/WVJ>.

        Attachments

        1. snowball_stemmer.patch
          34 kB
          Marvin Humphrey
        2. snowball_stopfilter.patch
          44 kB
          Marvin Humphrey
        3. regex_tokenizer.patch
          63 kB
          Marvin Humphrey

          Activity

            People

            • Assignee:
              marvin Marvin Humphrey
              Reporter:
              marvin Marvin Humphrey
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: