Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2162

"Unknown compression method" on a Powerpoint file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.13
    • 1.15, 2.0.0
    • parser
    • None
    • Windows 7 x64, JVM 1.8.0_101

    Description

      On the attached Powerpoint file, which opens fine with Powerpoint, the Tika parser throws the following error:

      org.apache.poi.hslf.exceptions.HSLFException: java.util.zip.ZipException: unknown compression method
      at org.apache.poi.hslf.blip.EMF.getData(EMF.java:91)
      at org.apache.tika.parser.microsoft.HSLFExtractor.handleSlideEmbeddedPictures(HSLFExtractor.java:324)
      at org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:193)
      at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:149)
      at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:117)
      Caused by: java.util.zip.ZipException: unknown compression method
      at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:164)
      at java.io.FilterInputStream.read(FilterInputStream.java:107)
      at org.apache.poi.hslf.blip.EMF.getData(EMF.java:85)
      ... 6 more

      Attachments

        1. DECAY.ppt
          645 kB
          Seva Alekseyev

        Activity

          People

            Unassigned Unassigned
            sevaa Seva Alekseyev
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: