Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.10
-
None
-
None
Description
[ruby] I see strange behaviour, a difference between tika runs via console or as a server. The xml output called via the console contains one line more meta (<meta name="Content-Length" content="26845"/> ), but from the cmd-run the complete file is 337 lines long whereas the output from the server-version is 444 lines long.
The input file is here: http://ratsinfo.dresden.de/getfile.php?id=12174&type=do
console cmd line:
$ java -server -Djava.awt.headless=true -jar 'tika-app.jar' --xml '~/data/0012174.pdf'
the request against the server:
curl -T '~/data/0012174.pdf' http://localhost:9998/tika --header "Accept: text/xml"
Is here is a possibility to upload files, please give me a hint, otherwise i put the two files on my server for a while.