Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2172

can not read Arabic file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 1.12
    • 1.13
    • languageidentifier, parser
    • windows , using python

    Description

      when trying to extract text from the attached file ( in section External Issue URL) , the text is rubbish
      How can I define my fonts and glyphs manually for pdf file
      thanks

      Attachments

        Activity

          People

            Unassigned Unassigned
            ahmad.sawal Ahmad Sawalhah
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: