Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3789

Allow parsers to pass embedded metadata to container file's metadata

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      There are some use cases where custom parsers might want to pass metadata from embedded files to the parent's metadata in the /tika (json) output or programmatically.

      We can follow the pattern in TIKA-3788.

      As with TIKA-3788, this metadata will be written after the parse so it will not show up in standard xhtml output (e.g. /tika (html/xhtml) or programmatically in the XHTMLContentHandler). However, it will appear in the json output option from /tika and in the Metadata object programmatically.

      As with TIKA-3788, we encourage using the /rmeta endpoint, -J in tika-app or the RecursiveParserWrapper instead of this option. However, for those who need to work with a flattened view of a document, this can be invaluable.

      Attachments

        Activity

          People

            Unassigned Unassigned
            tallison Tim Allison
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: