Bug 26012

Summary: ship pdftotext binary with lenya or re-evaluate pdf box
Product: Lenya Reporter: Michael Wechner <michi>
Component: Build SystemAssignee: Lenya Developers <dev>
Status: NEW ---    
Severity: enhancement CC: dev
Priority: P3    
Version: Trunk   
Target Milestone: 2.0.1   
Hardware: Other   
OS: other   
Bug Depends on: 33702    
Bug Blocks:    

Description Michael Wechner 2004-01-09 09:16:10 UTC
indexing of pdf documents needs one of those external programs. we should ship
one of them out of the box.
Comment 1 Gregor J. Rothfuss 2005-03-21 03:39:25 UTC
nutch ships with a pdf plugin based on pdfbox:

http://svn.apache.org/viewcvs.cgi/incubator/nutch/trunk/src/plugin/parse-pdf/