Details
-
New Feature
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
None
-
None
-
None
Description
It would be nice to have a Tika-based command line application that takes in a document (either via standard input or as a filename or URL argument) and outputs the extracted metadata and text content (either as XHTML or plain text).