Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-733 Stanbol NLP processing
  3. STANBOL-740

Adopt the KeywordLinkingEngine to use the AnalyzedText content part

    XMLWordPrintableJSON

Details

    Description

      The KeywordLinkingEngine currently does both NLP processing AND linking against the target vocabulary. Up to now this was the only possibility as separating those two things was not feasible with the limitations of the RDF metadata.

      With the introduction of the AnalyzedText content part the NLP processing part needs no longer be part of the KeywordLinkingEngine.

      This issue covers

      • removal of the NLP related functionality from the KeywordLinkingEngine
      • reimplementation of the linking part on top of the API provided by the AnalyzedText contentpart
      • add support fore new features of the NLP chain
      • use lemmas - if available - for entity lookup
      • use POS tagset mappings to the OLIA ontology to decide what tokens to lookup

      After this change the KeywordLinkingEngine will also be able to work in combination with any NLP framework that is integrated with the Stanbol NLP components (writes its data to the AnalyzedText content part).

      Attachments

        Issue Links

          Activity

            People

              rwesten Rupert Westenthaler
              rwesten Rupert Westenthaler
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: