Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Not A Bug
-
1.21
-
None
-
None
-
Tested on DEV env
Description
I am using tika-core 1.21 and tika-parsers 1.21 jar files as tika dependencies in Manifoldcf 2.14 version to crawl some files, Out of which some of the PDF's files are not getting parsed correctly.
Getting some issues while parsing PDF files. Some strange characters appeared, tried changing Tika jar files version also 1.24 and 1.27 (for 1.27-it didn't even extract files correctly).
Also checked with the document content, it seems to be fine.
Can anybody help me on this.
Image attached for reference of strange characters.
Tried version changing , but didn't help