Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Ubuntu 14.04 amd64
$ java -version
java version "1.7.0_65"
OpenJDK Runtime Environment (IcedTea 2.5.1) (7u65-2.5.1-4ubuntu1~0.14.04.2)
OpenJDK 64-Bit Server VM (build 24.65-b04, mixed mode)
Description
I stumbled on a problem with a SPARQL query, run against Fuseki 1.1.0 and the latest 1.1.1 snapshot. This is a minimal example demonstrating the problem.
I have the following single triple (based on Lexvo.org) as my dataset:
<http://lexvo.org/id/iso3166/AQ> <http://www.w3.org/2000/01/rdf-schema#label> "\U00010408\U0001044C\U0001043B\U0001042A\U00010449\U0001043F\U0001043B\U0001042E\U0001043F\U00010432"@en-Dsrt .
(this is a label in the rather obscure Deseret alphabet - see wikipedia for details about the script)
Then I ran this query:
SELECT ?label (SUBSTR(?label, 1, 1) as ?l) WHERE { ?s <http://www.w3.org/2000/01/rdf-schema#label> ?label }
With the text output, I get this:
------------------------------------------------ | label | l | ================================================ | "𐐈𐑌𐐻𐐪𐑉𐐿𐐻𐐮𐐿𐐲"@en-Dsrt | " | ------------------------------------------------
(you may need to install a Deseret font if you can't see the letters above. I use DeseretBee from here: http://copper.chem.ucla.edu/~jericks/Fonts/Bee%20Fonts/Sans%20Serif/_DeseretBee2.ttf)
The result is not completely wrong, but there's something fishy about the ?l binding - it only shows one quote.
With output set to JSON, XML, CSV or TSV, I get this error instead:
Error 500: java.nio.charset.MalformedInputException: Input length = 1 Fuseki - version 1.1.1-SNAPSHOT (Build date: 2014-09-16T13:16:19+0000)
As you can see I'm using the latest Fuseki snapshot from yesterday.