Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1086

Tika-bundle 1.3 does not import org.w3c.dom package

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.3
    • 1.5
    • parser
    • None

    Description

      The tika-bundle 1.3 version does not import org.w3c.dom package, as a result it is not able to parse DOM based documents such as Microsoft Word (docx) documents.

      This issue does not have in version 1.2 as it does import the necessary package and therefore the parsing of the documents work fine.

      Can someone please look into the issue, as Microsoft Word is a very popular document.

      Attachments

        1. TIKA-1086.svn.diff
          0.6 kB
          Niels Beekman

        Issue Links

          Activity

            People

              Unassigned Unassigned
              gsehgal Gaurav
              Votes:
              2 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: