1. PDFBox

Text extraction


Issues: Unresolved

Key Summary Due Date
Bug PDFBOX-2252 PDFTextStripper has problem with bilingual documents
Bug PDFBOX-2749 Annotations character bounding boxes size 3 times higher than expected
Bug PDFBOX-448 Columns in text not extracted separately

View Issues

Issues: Updated recently

Key Summary Updated
Bug PDFBOX-2843 widthOfSpace() appears wrong in TextPosition
Bug PDFBOX-2839 Missing TextPosition(s) in PDFTextStripper
Bug PDFBOX-2831 ArrayIndexOutOfBoundsException in mergeDiacritic() on extraction of text with diacritic text

View Issues

Versions: Unreleased

Name Release date
Unreleased 1.8.10  
Unreleased 2.0.0  
Unreleased 2.1.0  
Unreleased 3.0.0