Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-646

tika command line can't extract metadata for OOXML files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.10
    • None
    • None

    Description

      Tika CLI application displays metadata on endDocument() event. Some parsers (OOXML for example) fills metadata after text extraction (after endDocument), that data is missed in output.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              maxim.valyanskiy Maxim Valyanskiy
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: