Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3683

Documentation of native dependencies per module

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • tika-docker, tika-server
    • None

    Description

      I created a custom Docker image using the latest Tesseract release. I came across the tika Dockerfile file which installs the following dependencies:

      xfonts-utils
      fonts-freefont-ttf
      fonts-liberation
      ttf-mscorefonts-installer
      cabextract

      I have not found any documetation yet about those dependencies in https://cwiki.apache.org/confluence/display/tika and https://github.com/apache/tika. I can only guess that those dependencies might impact PDF content handling.

      Attachments

        Activity

          People

            Unassigned Unassigned
            dataminer.accolade dataminer.accolade
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: