Uploaded image for project: 'Stanbol'
  1. Stanbol
  2. STANBOL-733 Stanbol NLP processing
  3. STANBOL-797

Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart

    Details

      Description

      Currently the OpenNLP NER engine consumes the plain text, detects sentences and tokenizes the text. However if an EnhancementChain does use the NLP processing module those data are likely already present in the AnalyzedText ContentPart.

      This will add functionality that allows the OpenNLP NER engine to consume those data. If no AnalyzedText ContentPart is present or the AnalyzedText does not contain Token information the engine should still perform tokenization and sentence detection

        Activity

        Hide
        rwesten Rupert Westenthaler added a comment -

        NerTags are now added to the AnalyzedText content part

        Show
        rwesten Rupert Westenthaler added a comment - NerTags are now added to the AnalyzedText content part
        Hide
        rwesten Rupert Westenthaler added a comment -

        Add functionality to add NLP annotations also to the AnalyzedContent part so that other engines that prefer to use the NLP API can consume them without accessing the RDF enhancement structure

        Show
        rwesten Rupert Westenthaler added a comment - Add functionality to add NLP annotations also to the AnalyzedContent part so that other engines that prefer to use the NLP API can consume them without accessing the RDF enhancement structure
        Hide
        rwesten Rupert Westenthaler added a comment -
        Show
        rwesten Rupert Westenthaler added a comment - implementation provided with http://svn.apache.org/viewvc?rev=1406168&view=rev

          People

          • Assignee:
            rwesten Rupert Westenthaler
            Reporter:
            rwesten Rupert Westenthaler
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development