Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-869

Entities with unicode escaped chars ('\u????') in the URI are not indexed

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • entityhub-0.11.0
    • Entityhub
    • None

    Description

      When reading Entity URIs with a EntityIterator implementation of the Entityhub Indexing Tool unicode escaped chars are not converted to their UTF representation. Because of that Entities with such URIs might not be found by the EntityDataProvider implementation.

      For the JenaTDB indexing source this is the case and because of that any DBpedia entity that does use an unicode escaped character in its URI is currently not indexed.

      The EntityDataIterable implementation is not affected by this. Therefore given the currently used default configuration this will mainly affect the dbpedia indexing tool configuration and not users that use the generic RDF indexing tool configuration.

      Attachments

        Activity

          People

            rwesten Rupert Westenthaler
            rwesten Rupert Westenthaler
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: