Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3738

ForkParser missing metadata for some document formats

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 2.3.0
    • None
    • parser
    • None
    • Java 11.0.14.

    Description

      When using ForkParser, metadata from some parsers is not being returned in the Metadata object or in the head of the returned XML. These include OpenDocument Presentation (ODP), OpenDocument Spreadsheet (ODS), Microsoft Word 2006 XML, MP4 Audio (M4A) and MP4 Video (MP4).

      Patch for ForkParserIntegrationTest showing the issue for these file types is attached, along with an MP4 video file containing metadata as there doesn't appear to be one currently in the test set.

      Attachments

        1. testVideoMetadataMp4.mp4
          1.01 MB
          Stephen H
        2. ForkParserIntegrationTest.java.diff
          5 kB
          Stephen H

        Activity

          People

            Unassigned Unassigned
            steveaitch Stephen H
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: