Tika
  1. Tika
  2. TIKA-792

NoSuchMethodException "CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)" processing a OOXML document

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1.0
    • Fix Version/s: 1.2
    • Component/s: parser
    • Labels:
      None
    • Environment:

      Linux, JDK 1.6, Jetty 8.x, Tomcat 6.x

      Description

      Parsing some OOXML documents, this stacktrace is logged many times:

      java.lang.NoSuchMethodException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTMarkupImpl.<init>(org.apache.xmlbeans.SchemaType, boolean)
      at java.lang.Class.getConstructor0(Class.java:2723)
      at java.lang.Class.getDeclaredConstructor(Class.java:2002)
      at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.getJavaImplConstructor2(SchemaTypeImpl.java:1749)
      at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedSubclass(SchemaTypeImpl.java:1886)
      at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createUnattachedNode(SchemaTypeImpl.java:1875)
      at org.apache.xmlbeans.impl.schema.SchemaTypeImpl.createElementType(SchemaTypeImpl.java:1021)
      at org.apache.xmlbeans.impl.values.XmlObjectBase.create_element_user(XmlObjectBase.java:893)
      at org.apache.xmlbeans.impl.store.Xobj.getUser(Xobj.java:1657)
      at org.apache.xmlbeans.impl.store.Cur.getUser(Cur.java:2654)
      at org.apache.xmlbeans.impl.store.Cur.getObject(Cur.java:2647)
      at org.apache.xmlbeans.impl.store.Cursor._getObject(Cursor.java:995)
      at org.apache.xmlbeans.impl.store.Cursor.getObject(Cursor.java:2904)
      at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:83)
      at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:145)
      at org.apache.poi.POIXMLDocument.load(POIXMLDocument.java:159)
      at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:115)
      at org.apache.poi.xwpf.extractor.XWPFWordExtractor.<init>(XWPFWordExtractor.java:53)
      at org.apache.poi.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:180)
      at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:63)
      at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:69)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)

      Looking at the poi code java is right here, there is no constructor with a SchemaType and a boolean, only with SchemaType.
      My guess is this one was missed during upgrade to poi beta4, but only a guess, anyway needs a fix .

      1. test10.docx
        17 kB
        Tim Allison

        Activity

        Torsten Krah created issue -
        Nick Burch made changes -
        Field Original Value New Value
        Status Open [ 1 ] Resolved [ 5 ]
        Fix Version/s 1.2 [ 12320169 ]
        Resolution Fixed [ 1 ]
        Tim Allison made changes -
        Attachment test10.docx [ 12596081 ]
        Tim Allison made changes -
        Resolution Fixed [ 1 ]
        Status Resolved [ 5 ] Reopened [ 4 ]
        Tim Allison made changes -
        Status Reopened [ 4 ] Resolved [ 5 ]
        Resolution Fixed [ 1 ]
        Tim Allison made changes -
        Status Resolved [ 5 ] Closed [ 6 ]

          People

          • Assignee:
            Unassigned
            Reporter:
            Torsten Krah
          • Votes:
            1 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development