Uploaded image for project: 'Apache Any23 (Retired)'
  1. Apache Any23 (Retired)
  2. ANY23-44

error when parsing a document from http://www.afdsi.org/docs/test/html/RDFa/_food-stream_.htm

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 0.7.0
    • 0.8.0
    • None
    • None

    Description

      This document causing the following exception

      http://any23.org/any23/?format=best&uri=http%3A%2F%2Fwww.afdsi.org%2Fdocs%2Ftest%2Fhtml%2FRDFa%2F_food-stream_.htm

      Could not parse input.
      ================================================================
      org.deri.any23.extractor.ExtractionException: An error occurred during the XSLT application.
      	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:138)
      	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:49)
      	at org.deri.any23.extractor.SingleDocumentExtraction.runExtractor(SingleDocumentExtraction.java:422)
      	at org.deri.any23.extractor.SingleDocumentExtraction.run(SingleDocumentExtraction.java:212)
      	at org.deri.any23.Any23.extract(Any23.java:279)
      	at org.deri.any23.Any23.extract(Any23.java:441)
      	at org.deri.any23.servlet.WebResponder.runExtraction(WebResponder.java:98)
      	at org.deri.any23.servlet.Servlet.doGet(Servlet.java:71)
      	at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
      	at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
      	at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:290)
      	at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
      	at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
      	at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
      	at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
      	at com.googlecode.psiprobe.Tomcat60AgentValve.invoke(Tomcat60AgentValve.java:30)
      	at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
      	at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
      	at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
      	at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:857)
      	at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:588)
      	at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
      	at java.lang.Thread.run(Thread.java:662)
      Caused by: org.deri.any23.extractor.rdfa.XSLTStylesheetException:  An error occurred during the XSLT transformation
      	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:93)
      	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:63)
      	at org.deri.any23.extractor.rdfa.RDFaExtractor.run(RDFaExtractor.java:136)
      	... 22 more
      Caused by: javax.xml.transform.TransformerException: java.lang.RuntimeException: An attribute whose value must be a QName had the value 'dct:relation:hasPart'
      	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:716)
      	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:313)
      	at org.deri.any23.extractor.rdfa.XSLTStylesheet.applyTo(XSLTStylesheet.java:79)
      	... 24 more
      Caused by: java.lang.RuntimeException: An attribute whose value must be a QName had the value 'dct:relation:hasPart'
      	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.runTimeError(BasisLibrary.java:1523)
      	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.runTimeError(BasisLibrary.java:1527)
      	at com.sun.org.apache.xalan.internal.xsltc.runtime.BasisLibrary.checkQName(BasisLibrary.java:1381)
      	at GregorSamsa.property()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$1()
      	at GregorSamsa.applyTemplates1()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
      	at GregorSamsa.applyTemplates1()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
      	at GregorSamsa.applyTemplates1()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
      	at GregorSamsa.applyTemplates1()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$23()
      	at GregorSamsa.applyTemplates1()
      	at GregorSamsa.http$colon$$slash$$slash$www$dot$w3$dot$org$slash$1999$slash$XSL$slash$Transform$colon$template$dot$0()
      	at GregorSamsa.applyTemplates()
      	at GregorSamsa.transform()
      	at com.sun.org.apache.xalan.internal.xsltc.runtime.AbstractTranslet.transform(AbstractTranslet.java:603)
      	at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerImpl.transform(TransformerImpl.java:709)
      	... 26 more
      ================================================================
      

      as I do not know how to attache a file will paste the content of the file below (in case it is modified or removed from the web )

      <?xml version="1.0" encoding="utf-8"?>
      <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.1//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-2.dtd">
      <html 
       xmlns:x="http://www.w3.org/1999/xhtml"
       xmlns:v="http://rdf.data-vocabulary.org/rdf.xml"
       xmlns:dc="http://purl.org/dc/elements/1.1/"
       xmlns:dct="http://purl.org/dc/terms/"
       xmlns:dctype="http://purl.org/dc/dcmitype/"
       xmlns:agmes="http://purl.org/agmes/1.1/"
       version="XHTML+RDFa 1.1"
       xml:lang="en"
      >
      <head>
      <meta http-equiv="Content-Script-Type" content="text/javascript" />
      <meta http-equiv="Content-Type" content="application/xhtml+xml; charset=utf-8" />
      <link rel="stylesheet" type="text/css" href="http://208.109.87.107/public/3a/b4/1c/_html-elements_12-01-25v1_.css" />
      <link rel="shortcut icon" href="http://208.109.87.107/public/3a/b2/1c/favicon.ico" />
      <title>Food Stream</title>
      <link rel="profile" href="http://microformats.org/profile/hcard" />
      </head>
      <body>
      <h1>Documenting a graphic</h1>
      <dl class="dublincore">
      <dt>Title</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:title">Food Stream</span></dd>
      <dt>Creator:</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:creator">Gig Graham</span></dd>
      <dt>Contributor</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:contributor">Garrett Long</span></dd>
      <dt>Product:</dt>
      <dd><span vocab="http://rdf.data-vocabulary.org/#" typeof="v:Product">Product text here</span></dd>
      <dd><span property="v:brand">Product brand text here</span></dd>
      <dd><span property="v:description">Product description text here</span></dd>
      <dt>Offer:</dt>
      <dd><span vocab="http://rdf.data-vocabulary.org/#" typeof="v:Offer">Offer text here</span></dd>
      <dd><span property="v:offerdetails">Offer detail text here</span></dd>
      <dd><span property="v:price" datatype="x:integer">4000</span>
      <span property="v:currency">USD</span></dd>
      <dt>Description:</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:description">Description text here</span></dd>
      <dt>Publisher:</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:publisher">American Food Data Systems Institute</span></dd>
      <dt>Date Created:</dt>
      <dd><span vocab="http://purl.org/dc/terms/" property="dct:issued">2012-01-15</span></dd>
      <dt>Date Modified:</dt>
      <dd><span vocab="http://purl.org/dc/terms/" property="dct:modified">Not modified</span></dd>
      <dt>Type</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:type">Set of images</span></dd>
      <dt>Format</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:format">image/png</span></dd>
      <dt>Identifiier</dt>
      <dd><span vocab="http://purl.org/dc/terms/" property="dct:identifier"><a class="image" href="http://www.afdsi.org/media/images/drupal/front_page/_food_stream_11-12-05_.png" onclick="window.open(this.href); return false">Name of graphic</a></span></dd>
      <dt>Includes Content From:</dt>
      <dd><span vocab="http://purl.org/dc/terms/" property="dct:relation:hasPart"><a class="image" href="http://thenounproject.com/" onclick="window.open(this.href); return false">The Noun Project</a></span></dd>
      <dt>Is Part Of:</dt>
      <dd><span vocab="http://purl.org/dc/terms/" property="dct:relation:isPartOf"><a class="image" href="http://www.afdsi.org/media/images/drupal/front_page/_food_stream_11-12-05_.png" onclick="window.open(this.href); return false">AFDSI front page</a></span></dd>
      <dt>Rights</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:rights"><a class="url" href="http://creativecommons.org/licenses/by-nc-nd/3.0/" onclick="window.open(this.href); return false">Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0)</a></span></dd>
      <dt>Audience</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">News Media</span></dd>
      <dt>Audience</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Community Groups</span></dd>
      <dt>Audience</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Parents and Families</span></dd>
      <dt>Audience</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Policymakers</span></dd>
      <dt>Audience</dt>
      <dd><span vocab="http://purl.org/dc/elements/1.1/" property="dc:audience">Researchers</span></dd>
      <dt>Subject using terms from UN Food and Agriculture Organization Subject Categories (AGRIS)</dt>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">AGRIS-1</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">AGRIS-2</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:ASC"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">AGRIS-3</a></span></dd>
      <dt>Subject using terms from US Department of Agriculture National Agricultural Library Thesaurus (NALT)</dt>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">NALT-1</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">NALT-2</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:NALT"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">NALT-3</a></span></dd>
      <dt>Subject using terms from US Library of Congress Subject Headings (LCSH)</dt>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">LCSH-1</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">LCSH-2</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:LCSH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">LCSH-3</a></span></dd>
      <dt>Subject using terms from US National Library of Medicine Medical Subject Headings (MeSH)</dt>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">MeSH-1</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">MeSH-2</a></span></dd>
      <dd><span vocab="http://purl.org/agmes/1.1/" property="agmes:MESH"><a class="url" href="http://www.iana.org/domains/example/" onclick="window.open(this.href); return false" rel="tag">MeSH-3</a></span></dd>
      </dl>
      </body>
      </html>
      

      Attachments

        1. doc1.htm
          7 kB
          Szymon Danielczyk

        Activity

          People

            Unassigned Unassigned
            szydan Szymon Danielczyk
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: