Issue 22206 - Wrong character encoding in text and also in text frames
Summary: Wrong character encoding in text and also in text frames
Status: CLOSED FIXED
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: OOo 1.1
Hardware: PC Linux, all
: P3 Trivial (vote)
Target Milestone: ---
Assignee: michael.ruess
QA Contact: issues@sw
URL:
Keywords:
Depends on:
Blocks: 21229
  Show dependency tree
 
Reported: 2003-11-06 16:33 UTC by pavel
Modified: 2013-08-07 14:41 UTC (History)
1 user (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Sample file (13.00 KB, application/octet-stream)
2003-11-06 16:33 UTC, pavel
no flags Details
Sample file (12.50 KB, application/octet-stream)
2003-11-06 16:33 UTC, pavel
no flags Details
Sample file (16.00 KB, application/octet-stream)
2003-11-06 16:34 UTC, pavel
no flags Details
patch against 680 (4.12 KB, patch)
2003-11-11 11:03 UTC, caolanm
no flags Details | Diff

Note You need to log in before you can comment on or make changes to this issue.
Description pavel 2003-11-06 16:33:05 UTC
Hi,

the attached documents are displayed wrongly in Writer. Czech letter r with hook
is shown as o slash, e with hook is shown as e with ` etc.

Czech version contains hackish patch
ftp://ftp.linux.cz/pub/localization/OpenOffice.org/devel/1.1.0/build-6/build/Patches/OOo_1.1.0_source-fixes_for_word_6_and_95.diff
that fixes the encoding in normal text, but the encoding in text frames is still
wrong.
Comment 1 pavel 2003-11-06 16:33:30 UTC
Created attachment 11013 [details]
Sample file
Comment 2 pavel 2003-11-06 16:33:59 UTC
Created attachment 11014 [details]
Sample file
Comment 3 pavel 2003-11-06 16:34:27 UTC
Created attachment 11015 [details]
Sample file
Comment 4 h.ilter 2003-11-07 10:31:53 UTC
Reassigned to US
Comment 5 ulf.stroehler 2003-11-07 10:59:27 UTC
First thought this was an issue dealing with encoded text. But the
bugdocs are (binary) MS winword files.
US->MRU: could you pls. have a look what makes some characters look so
strange. I don't think it is a font issue but as Pavel pointed out an
encoding issue in the import filter (?).
Pls. change target if not feasible for OOo 1.1.1. Thx.
Comment 6 michael.ruess 2003-11-10 15:39:33 UTC
MRU->CMC: This document is WW6 format. Maybe this has been geberated
with a localized version of Word. when I create either a WW6 or
WW8/9/10 format doc with WordXP, OO Writer is able to import it correctly.
Comment 7 caolanm 2003-11-11 11:03:58 UTC
Created attachment 11150 [details]
patch against 680
Comment 8 caolanm 2003-11-11 11:05:29 UTC
This is a bit tentative, but works for this class of document,
recreating documents in ww6/7 format with later versions of word puts
the correct charset into each range of text, so it only occurs if that
doesn't happen and the language the document is written in is not a
western european I believe.

Fixed in portlaoisefilterteam16 for 2.0
Comment 9 caolanm 2003-11-12 12:17:38 UTC
reopen to reassign
Comment 10 caolanm 2003-11-12 12:17:56 UTC
cmc->mru: Working in portlaoise16 for 2.0 (build: Wed-Nov-12-12-00)
Comment 11 michael.ruess 2003-11-18 14:24:13 UTC
Looks good with CWS portlaoisefilterteam16.
Comment 12 michael.ruess 2003-11-18 14:25:34 UTC
Verified. Fix will be included in OO 2.0.
Comment 13 michael.ruess 2004-05-05 13:17:37 UTC
Checked fix with OO 2.0 snapshot build 680m36.
Comment 14 pavel 2004-05-07 09:55:58 UTC
Thanks!

I have also verified it in cws_src680_ooo20040509.

Great!!!