Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2172

can not read Arabic file

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: 1.12
    • Fix Version/s: 1.13
    • Component/s: languageidentifier, parser
    • Labels:
    • Environment:

      windows , using python

      Description

      when trying to extract text from the attached file ( in section External Issue URL) , the text is rubbish
      How can I define my fonts and glyphs manually for pdf file
      thanks

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ahmad.sawal Ahmad Sawalhah
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: