Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
There are some use cases where custom parsers might want to pass metadata from embedded files to the parent's metadata in the /tika (json) output or programmatically.
We can follow the pattern in TIKA-3788.
As with TIKA-3788, this metadata will be written after the parse so it will not show up in standard xhtml output (e.g. /tika (html/xhtml) or programmatically in the XHTMLContentHandler). However, it will appear in the json output option from /tika and in the Metadata object programmatically.
As with TIKA-3788, we encourage using the /rmeta endpoint, -J in tika-app or the RecursiveParserWrapper instead of this option. However, for those who need to work with a flattened view of a document, this can be invaluable.