I've tweaked the TesseractOCRParser and TesseractOCRConfig to add the "txt" or "hocr" parameters that allows you to get specific outputs. There are also "pdf" and in the next version of Tesseract a "tsv" outputs, but didn't add support for those.
- links to