Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
whilst using the any23.org service today, I recently came across an issue whilst parsing the following URL
https://www.rottentomatoes.com/m/the_lord_of_the_rings_the_fellowship_of_the_ring/
<?xml version="1.0" encoding="UTF-8" ?> <report> <message>Internal error.</message> <error> <![CDATA[ java.lang.IllegalArgumentException: Illegal character in path at index 6: Action & Adventure at java.net.URI.create(URI.java:852) at java.net.URI.resolve(URI.java:1036) at com.github.jsonldjava.utils.JsonLdUrl.resolve(JsonLdUrl.java:274) at com.github.jsonldjava.core.Context.expandIri(Context.java:538) at com.github.jsonldjava.core.Context.expandValue(Context.java:1099) at com.github.jsonldjava.core.JsonLdApi.expand(JsonLdApi.java:979) at com.github.jsonldjava.core.JsonLdApi.expand(JsonLdApi.java:517) at com.github.jsonldjava.core.JsonLdApi.expand(JsonLdApi.java:819) at com.github.jsonldjava.core.JsonLdApi.expand(JsonLdApi.java:997) at com.github.jsonldjava.core.JsonLdProcessor.expand(JsonLdProcessor.java:146) at com.github.jsonldjava.core.JsonLdProcessor.toRDF(JsonLdProcessor.java:485) at org.eclipse.rdf4j.rio.jsonld.JSONLDParser.parse(JSONLDParser.java:68) at org.apache.any23.extractor.rdf.BaseRDFExtractor.run(BaseRDFExtractor.java:105) at org.apache.any23.extractor.html.EmbeddedJSONLDExtractor.extractJSONLDScript(EmbeddedJSONLDExtractor.java:147) at org.apache.any23.extractor.html.EmbeddedJSONLDExtractor.run(EmbeddedJSONLDExtractor.java:82) at org.apache.any23.extractor.html.EmbeddedJSONLDExtractor.run(EmbeddedJSONLDExtractor.java:53) at org.apache.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:476) at org.apache.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:256) at org.apache.any23.Any23.extract(Any23.java:300) at org.apache.any23.Any23.extract(Any23.java:452) at org.apache.any23.servlet.WebResponder.runExtraction(WebResponder.java:117) at org.apache.any23.servlet.Servlet.doGet(Servlet.java:82) at javax.servlet.http.HttpServlet.service(HttpServlet.java:635) at javax.servlet.http.HttpServlet.service(HttpServlet.java:742) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:231) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:166) at org.apache.tomcat.websocket.server.WsFilter.doFilter(WsFilter.java:53) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:193) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:166) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:199) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:96) at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBase.java:475) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:140) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:80) at org.apache.catalina.valves.AbstractAccessLogValve.invoke(AbstractAccessLogValve.java:624) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:87) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:342) at org.apache.coyote.ajp.AjpProcessor.service(AjpProcessor.java:403) at org.apache.coyote.AbstractProcessorLight.process(AbstractProcessorLight.java:66) at org.apache.coyote.AbstractProtocol$ConnectionHandler.process(AbstractProtocol.java:796) at org.apache.tomcat.util.net.NioEndpoint$SocketProcessor.doRun(NioEndpoint.java:1366) at org.apache.tomcat.util.net.SocketProcessorBase.run(SocketProcessorBase.java:49) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at org.apache.tomcat.util.threads.TaskThread$WrappingRunnable.run(TaskThread.java:61) at java.lang.Thread.run(Thread.java:748) Caused by: java.net.URISyntaxException: Illegal character in path at index 6: Action & Adventure at java.net.URI$Parser.fail(URI.java:2848) at java.net.URI$Parser.checkChars(URI.java:3021) at java.net.URI$Parser.parseHierarchical(URI.java:3105) at java.net.URI$Parser.parse(URI.java:3063) at java.net.URI.<init>(URI.java:588) at java.net.URI.create(URI.java:850) ... 45 more ]]> </error> <issueReport>
I would like to upgrade jsonld-java to see if the issue is fixed. If not then I will report it and submit a fix at https://github.com/jsonld-java/jsonld-java
Attachments
Issue Links
- links to