Details
Description
On the attached PowerPoint presentation, which opens fine with PowerPoint, the Tika parser throws the following error:
java.lang.ArrayIndexOutOfBoundsException
at java.lang.System.arraycopy(Native Method)
at org.apache.poi.hslf.usermodel.HSLFSlideShowImpl.readPictures(HSLFSlideShowImpl.java:438)
at org.apache.poi.hslf.usermodel.HSLFSlideShowImpl.getPictureData(HSLFSlideShowImpl.java:772)
at org.apache.poi.hslf.usermodel.HSLFSlideShow.getPictureData(HSLFSlideShow.java:547)
at org.apache.tika.parser.microsoft.HSLFExtractor.handleSlideEmbeddedPictures(HSLFExtractor.java:305)
at org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:193)
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:149)
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:117)
Attachments
Attachments
Issue Links
- depends upon
-
TIKA-2116 Upgrade to POI 3.16-beta1 when available
- Resolved