Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-849

Implement Lucene Tokenizer based LabelTokenizer

    XMLWordPrintableJSON

Details

    Description

      Lucene supports Tokenizers for a lot of languages. While the OpenNLP or Whitespace character based Tokenizers are fine for most of the languages this allows users to use special one (e.g. for Chinese the smartcn analyzer package)

      Attachments

        Activity

          People

            rwesten Rupert Westenthaler
            rwesten Rupert Westenthaler
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: