Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1189

Fails to parse PPT file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.6
    • cli, gui
    • None
    • OSX 10.9, OSX 10.6

    Description

      Out of the box tika application when presented with the file results in

      Apache Tika was unable to parse the document
      at /Volumes/FREECOM_HDD/Test/CDT_Data_Retention-PPT.ppt.

      The full exception stack trace is included below:

      org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@224f9db
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
      at org.apache.tika.gui.TikaGUI.handleStream(TikaGUI.java:320)
      at org.apache.tika.gui.TikaGUI.openFile(TikaGUI.java:279)
      at org.apache.tika.gui.ParsingTransferHandler.importFiles(ParsingTransferHandler.java:94)
      at org.apache.tika.gui.ParsingTransferHandler.importData(ParsingTransferHandler.java:77)
      at javax.swing.TransferHandler.importData(TransferHandler.java:826)
      at javax.swing.TransferHandler$DropHandler.drop(TransferHandler.java:1536)
      at java.awt.dnd.DropTarget.drop(DropTarget.java:450)
      at javax.swing.TransferHandler$SwingDropTarget.drop(TransferHandler.java:1274)
      at sun.awt.dnd.SunDropTargetContextPeer.processDropMessage(SunDropTargetContextPeer.java:537)
      at sun.lwawt.macosx.CDropTargetContextPeer.processDropMessage(CDropTargetContextPeer.java:127)
      at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchDropEvent(SunDropTargetContextPeer.java:851)
      at sun.awt.dnd.SunDropTargetContextPeer$EventDispatcher.dispatchEvent(SunDropTargetContextPeer.java:775)
      at sun.awt.dnd.SunDropTargetEvent.dispatch(SunDropTargetEvent.java:48)
      at java.awt.Component.dispatchEventImpl(Component.java:4716)
      at java.awt.Container.dispatchEventImpl(Container.java:2287)
      at java.awt.Component.dispatchEvent(Component.java:4687)
      at java.awt.LightweightDispatcher.retargetMouseEvent(Container.java:4832)
      at java.awt.LightweightDispatcher.processDropTargetEvent(Container.java:4566)
      at java.awt.LightweightDispatcher.dispatchEvent(Container.java:4417)
      at java.awt.Container.dispatchEventImpl(Container.java:2273)
      at java.awt.Window.dispatchEventImpl(Window.java:2719)
      at java.awt.Component.dispatchEvent(Component.java:4687)
      at java.awt.EventQueue.dispatchEventImpl(EventQueue.java:735)
      at java.awt.EventQueue.access$200(EventQueue.java:103)
      at java.awt.EventQueue$3.run(EventQueue.java:694)
      at java.awt.EventQueue$3.run(EventQueue.java:692)
      at java.security.AccessController.doPrivileged(Native Method)
      at java.security.ProtectionDomain$1.doIntersectionPrivilege(ProtectionDomain.java:76)
      at java.security.ProtectionDomain$1.doIntersectionPrivilege(ProtectionDomain.java:87)
      at java.awt.EventQueue$4.run(EventQueue.java:708)
      at java.awt.EventQueue$4.run(EventQueue.java:706)
      at java.security.AccessController.doPrivileged(Native Method)
      at java.security.ProtectionDomain$1.doIntersectionPrivilege(ProtectionDomain.java:76)
      at java.awt.EventQueue.dispatchEvent(EventQueue.java:705)
      at java.awt.EventDispatchThread.pumpOneEventForFilters(EventDispatchThread.java:242)
      at java.awt.EventDispatchThread.pumpEventsForFilter(EventDispatchThread.java:161)
      at java.awt.EventDispatchThread.pumpEventsForHierarchy(EventDispatchThread.java:150)
      at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:146)
      at java.awt.EventDispatchThread.pumpEvents(EventDispatchThread.java:138)
      at java.awt.EventDispatchThread.run(EventDispatchThread.java:91)
      Caused by: java.lang.RuntimeException: Couldn't instantiate the class for type with id 5000 on class class org.apache.poi.hslf.record.DummyPositionSensitiveRecordWithChildren : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 5002 on class class org.apache.poi.hslf.record.DummyPositionSensitiveRecordWithChildren : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 5003 on class class org.apache.poi.hslf.record.BinaryTagDataBlob : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 4012 on class class org.apache.poi.hslf.record.StyleTextProp9Atom : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.ArrayIndexOutOfBoundsException: 56
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:185)
      at org.apache.poi.hslf.record.Record.findChildRecords(Record.java:128)
      at org.apache.poi.hslf.model.SimpleShape.getClientRecords(SimpleShape.java:347)
      at org.apache.poi.hslf.model.SimpleShape.getClientDataRecord(SimpleShape.java:319)
      at org.apache.poi.hslf.model.TextShape.getPlaceholderAtom(TextShape.java:591)
      at org.apache.poi.hslf.model.Sheet.getPlaceholder(Sheet.java:438)
      at org.apache.poi.hslf.model.HeadersFooters.isVisible(HeadersFooters.java:244)
      at org.apache.poi.hslf.model.HeadersFooters.isHeaderVisible(HeadersFooters.java:148)
      at org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:62)
      at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:189)
      at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:161)
      at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
      ... 42 more
      Caused by: java.lang.reflect.InvocationTargetException
      at sun.reflect.GeneratedConstructorAccessor12.newInstance(Unknown Source)
      at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
      at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:181)
      ... 53 more
      Caused by: java.lang.RuntimeException: Couldn't instantiate the class for type with id 5002 on class class org.apache.poi.hslf.record.DummyPositionSensitiveRecordWithChildren : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 5003 on class class org.apache.poi.hslf.record.BinaryTagDataBlob : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 4012 on class class org.apache.poi.hslf.record.StyleTextProp9Atom : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.ArrayIndexOutOfBoundsException: 56
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:185)
      at org.apache.poi.hslf.record.Record.findChildRecords(Record.java:128)
      at org.apache.poi.hslf.record.DummyPositionSensitiveRecordWithChildren.<init>(DummyPositionSensitiveRecordWithChildren.java:52)
      ... 57 more
      Caused by: java.lang.reflect.InvocationTargetException
      at sun.reflect.GeneratedConstructorAccessor12.newInstance(Unknown Source)
      at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
      at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:181)
      ... 59 more
      Caused by: java.lang.RuntimeException: Couldn't instantiate the class for type with id 5003 on class class org.apache.poi.hslf.record.BinaryTagDataBlob : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.RuntimeException: Couldn't instantiate the class for type with id 4012 on class class org.apache.poi.hslf.record.StyleTextProp9Atom : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.ArrayIndexOutOfBoundsException: 56
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:185)
      at org.apache.poi.hslf.record.Record.findChildRecords(Record.java:128)
      at org.apache.poi.hslf.record.DummyPositionSensitiveRecordWithChildren.<init>(DummyPositionSensitiveRecordWithChildren.java:52)
      ... 63 more
      Caused by: java.lang.reflect.InvocationTargetException
      at sun.reflect.GeneratedConstructorAccessor11.newInstance(Unknown Source)
      at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
      at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:181)
      ... 65 more
      Caused by: java.lang.RuntimeException: Couldn't instantiate the class for type with id 4012 on class class org.apache.poi.hslf.record.StyleTextProp9Atom : java.lang.reflect.InvocationTargetException
      Cause was : java.lang.ArrayIndexOutOfBoundsException: 56
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:185)
      at org.apache.poi.hslf.record.Record.findChildRecords(Record.java:128)
      at org.apache.poi.hslf.record.BinaryTagDataBlob.<init>(BinaryTagDataBlob.java:52)
      ... 69 more
      Caused by: java.lang.reflect.InvocationTargetException
      at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
      at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57)
      at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
      at java.lang.reflect.Constructor.newInstance(Constructor.java:526)
      at org.apache.poi.hslf.record.Record.createRecordForType(Record.java:181)
      ... 71 more
      Caused by: java.lang.ArrayIndexOutOfBoundsException: 56
      at org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
      at org.apache.poi.hslf.record.StyleTextProp9Atom.<init>(StyleTextProp9Atom.java:70)
      ... 76 more

      Attachments

        1. CDT_Data_Retention-PPT.ppt
          2.51 MB
          Aimee Dev

        Activity

          People

            Unassigned Unassigned
            aimdev Aimee Dev
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: