I have just finished a work to integrate pdfbox and zxing to extract barcodes, and I wanted to give this source code to your fundation.
Program do this :
- extract all scanned images in a PDF,
- apply some homebrew image filters to retrieve areas of interest,
- rotate cropped areas and send them to zxing to find any barcode
- aggregate all results in specific List
Hope it can be useful for Pdfbox or Lucence.
pdf scanned in 3803 ms
page=0, barcodeFormat=DATA_MATRIX, value=HP14601225523