Uploaded image for project: 'Apache Jena'
  1. Apache Jena
  2. JENA-785

exception getting substr() from Deseret string

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • Jena 2.12.1
    • None
    • None
    • Ubuntu 14.04 amd64

      $ java -version
      java version "1.7.0_65"
      OpenJDK Runtime Environment (IcedTea 2.5.1) (7u65-2.5.1-4ubuntu1~0.14.04.2)
      OpenJDK 64-Bit Server VM (build 24.65-b04, mixed mode)

    Description

      I stumbled on a problem with a SPARQL query, run against Fuseki 1.1.0 and the latest 1.1.1 snapshot. This is a minimal example demonstrating the problem.

      I have the following single triple (based on Lexvo.org) as my dataset:

      <http://lexvo.org/id/iso3166/AQ> <http://www.w3.org/2000/01/rdf-schema#label> "\U00010408\U0001044C\U0001043B\U0001042A\U00010449\U0001043F\U0001043B\U0001042E\U0001043F\U00010432"@en-Dsrt .
      

      (this is a label in the rather obscure Deseret alphabet - see wikipedia for details about the script)

      Then I ran this query:

      SELECT ?label (SUBSTR(?label, 1, 1) as ?l)
      WHERE { ?s <http://www.w3.org/2000/01/rdf-schema#label> ?label }
      

      With the text output, I get this:

      ------------------------------------------------
      | label                          | l           |
      ================================================
      | "𐐈𐑌𐐻𐐪𐑉𐐿𐐻𐐮𐐿𐐲"@en-Dsrt | " |
      ------------------------------------------------
      

      (you may need to install a Deseret font if you can't see the letters above. I use DeseretBee from here: http://copper.chem.ucla.edu/~jericks/Fonts/Bee%20Fonts/Sans%20Serif/_DeseretBee2.ttf)

      The result is not completely wrong, but there's something fishy about the ?l binding - it only shows one quote.

      With output set to JSON, XML, CSV or TSV, I get this error instead:

      Error 500: java.nio.charset.MalformedInputException: Input length = 1
      Fuseki - version 1.1.1-SNAPSHOT (Build date: 2014-09-16T13:16:19+0000)
      

      As you can see I'm using the latest Fuseki snapshot from yesterday.

      Attachments

        Activity

          People

            rvesse Rob Vesse
            osma Osma Suominen
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: