Apache OpenOffice (AOO) Bugzilla – Issue 22206
Wrong character encoding in text and also in text frames
Last modified: 2013-08-07 14:41:36 UTC
Hi, the attached documents are displayed wrongly in Writer. Czech letter r with hook is shown as o slash, e with hook is shown as e with ` etc. Czech version contains hackish patch ftp://ftp.linux.cz/pub/localization/OpenOffice.org/devel/1.1.0/build-6/build/Patches/OOo_1.1.0_source-fixes_for_word_6_and_95.diff that fixes the encoding in normal text, but the encoding in text frames is still wrong.
Created attachment 11013 [details] Sample file
Created attachment 11014 [details] Sample file
Created attachment 11015 [details] Sample file
Reassigned to US
First thought this was an issue dealing with encoded text. But the bugdocs are (binary) MS winword files. US->MRU: could you pls. have a look what makes some characters look so strange. I don't think it is a font issue but as Pavel pointed out an encoding issue in the import filter (?). Pls. change target if not feasible for OOo 1.1.1. Thx.
MRU->CMC: This document is WW6 format. Maybe this has been geberated with a localized version of Word. when I create either a WW6 or WW8/9/10 format doc with WordXP, OO Writer is able to import it correctly.
Created attachment 11150 [details] patch against 680
This is a bit tentative, but works for this class of document, recreating documents in ww6/7 format with later versions of word puts the correct charset into each range of text, so it only occurs if that doesn't happen and the language the document is written in is not a western european I believe. Fixed in portlaoisefilterteam16 for 2.0
reopen to reassign
cmc->mru: Working in portlaoise16 for 2.0 (build: Wed-Nov-12-12-00)
Looks good with CWS portlaoisefilterteam16.
Verified. Fix will be included in OO 2.0.
Checked fix with OO 2.0 snapshot build 680m36.
Thanks! I have also verified it in cws_src680_ooo20040509. Great!!!