Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
Description
[imported from SourceForge]
http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1258807
Originally submitted by gohan222 on 2005-08-13 22:47.
The PDFTextStripper misses some spacing in between
words. It crunches sentences together on occasions.
After running extract look for string "demandsofGoogle's".
[attachment on SourceForge]
http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1258807&file_id=145590
p125-ghemawat.zip (application/zip), 252359 bytes
Removes spacing
Attachments
Issue Links
- relates to
-
PDFBOX-349 Spaces between words ignored in scanned pdf files
- Closed