Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
2.0.21
-
None
Description
This happens when a text in a big font is followed by at least two lines of text in a smaller font: the last word of the first line is merged with the first word of the second line.
On the attached PDF, the extracted text is :
(...) some text awith smaller font (...)
instead of:
(...) some text with a smaller font (...)
I often encounter this kind of problem on invoices, where the company address (small text at the top right) is next to the company name & logo (big centered text at the top).
Attachments
Attachments
Issue Links
- breaks
-
PDFBOX-5213 PDFTextStripper adds next line symbol after sup values (regression)
- Open
- links to