Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Duplicate
-
None
-
None
-
None
Description
I have an example PDF with 90 degree rotation; Tika produces the
characters one line at a time. Ie, the doc has "Some rotated text,
here!" but Tika produces this:
<body><div class="page"><p>So m e r o t a t e d t e x t , h e r e !</p>
I'm able to copy/paste the text out correctly.
Attachments
Attachments
Issue Links
- duplicates
-
TIKA-2779 Integrate/parameterize new rotated text handling in PDFBox
- Resolved