Resolution: Not A Bug
Affects Version/s: 2.0.7
Fix Version/s: None
Component/s: Text extraction
Environment:Windows 7 (64 bit)
Hello Support Team,
I am working on a task where I have to extract formulas from PDF document and convert them into images.
But when I extract them using PDFBox, some of the symbols like Summation, Integral, or Big Parenthesis .etc are mixing up with its previous line.
I checked the output of DrawPrintTextLocations example with that particular PDF document and result does not look normal.
Red boxes are not aligned properly in the output as you will see in the attachment files.
I am, herewith, attaching the output of two pages and PDF document itself.
Please refer page no. 34 or 37 for this issue.
Thank you in advance!