Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1473

Apache Tika is not working for .docx documents

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.5, 1.6
    • 1.13, 2.0.0
    • parser
    • None

    Description

      I am using Apache Tika 1.6 to read different document files.
      It is reading pdf and old format doc files but when I try to read docx file, it gives me following exception:

      org.apache.tika.exception.TikaException: Failed to close temporary resources at org.apache.tika.io.TemporaryResources.dispose(TemporaryResources.java:152) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:127) ...

      The resource can not be closed because it is still being used by the Java Process, certainly the OOXML parser.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              francocatto Franco Catto
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: