Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.7
-
None
-
None
-
Windows XP
Description
The parser is not preserving the character encoding when parsing documents in Arabic UTF-8, specifically with .pdf and .doc. The resulting character output is undechipherable or just question-mark symbols.