Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Not A Bug
-
1.26
-
None
-
None
Description
When processing PDF document to the local Tika server using PUT request to endpoint http://localhost:9998/tika. If the PDFOcrStrategy is set to anything other than AUTO or NO_OCR, this causes extreme slowdown in processing of the PDF file.
It doesn't matter if the PDF document has inline images or not, the slowdown happens regardless.