Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1476

Allow TesseractOCRParser to be configured using an external configuration file

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Implemented
    • Affects Version/s: None
    • Fix Version/s: 1.7
    • Component/s: parser
    • Labels:
      None

      Description

      The TesseractOCRParser is great but configuration at the moment requires configuring up a TesseractOCRConfig instance and placing it on the ParseContext. For those who are not using Tika programmatically in their code, such as users of the Tika Server, this is difficult and requires code changes. It would be more helpful is this could also be configured using a properties file on the classpath.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                davemeikle Dave Meikle
                Reporter:
                davemeikle Dave Meikle
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: