Issue 19313 - Scandinavian characters (æøå) broken in some situations.
Summary: Scandinavian characters (æøå) broken in some situations.
Status: CLOSED FIXED
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: OOo 1.1 RC4
Hardware: PC Windows XP
: P3 Trivial with 2 votes (vote)
Target Milestone: ---
Assignee: henrikc
QA Contact: Unknown
URL:
Keywords: oooqa
Depends on:
Blocks:
 
Reported: 2003-09-09 11:45 UTC by henrikc
Modified: 2013-08-07 14:43 UTC (History)
3 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Sample text with repaired and broken 'æøå' chars. ä and ö are fine. (2.68 KB, application/octet-stream)
2003-09-09 11:46 UTC, henrikc
no flags Details
Word document showing broken chars (20.50 KB, application/octet-stream)
2003-09-09 13:50 UTC, henrikc
no flags Details
Word document showing broken chars (20.50 KB, application/octet-stream)
2003-09-09 13:51 UTC, henrikc
no flags Details
Word document showing broken chars (20.50 KB, application/octet-stream)
2003-09-09 13:55 UTC, henrikc
no flags Details
Here's an OOo version of the file. (6.79 KB, application/octet-stream)
2003-09-09 14:05 UTC, henrikc
no flags Details
MS Word 2000 file. Problem reproduced in OOo 1.1.0 rc8 Danish. (40.00 KB, application/octet-stream)
2003-09-11 12:24 UTC, henrikc
no flags Details
Another Word2000 file with the problem. OOo 1.1.0 rc8, Danish (20.00 KB, application/octet-stream)
2003-09-11 12:30 UTC, henrikc
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description henrikc 2003-09-09 11:45:25 UTC
Using the latest OOo builds, I noticed that the special Scandinavian characters 
Æ, Ø, Å, æ, ø, å are broken on import from Word documents. Also copying text 
from OOo to other applications (generic Windows RTF control), the letters are 
broken in the same pattern.




Sample text here - the correct text first, after the second 'OBS:' comes a 
sample of the broken text. I'll leave it to others to assign a priority to this. 




The problem is consistent, happens at every import / cut&paste operation.




Sample text:






OBS:


Installasjonen består av to deler. Først må du installere selve skriverdriveren. 
Det gjør du ved å klikke p? knappen Installer på denne siden. Deretter klikker 
du p? knappen Ekstra filer for å få installert de underliggende rutinene. 




Du skal ikke velge kommandoen Skriv ut til fil når du bruker pdf995, men bare 
klikke på Skriv ut som om du skrev ut på en vanlig papirskriver.






OBS:


Installasjonen best?r av to deler. F?rst m? du installere selve skriverdriveren. 
Det gj?r du ved ? klikke p? knappen Installer p? denne siden. Deretter klikker 
du p? knappen Ekstra filer for ? f? installert de underliggende rutinene. 




Du skal ikke velge kommandoen Skriv ut til fil n?r du bruker pdf995, men bare 
klikke p? Skriv ut som om du skrev ut p? en vanlig papirskriver.
Comment 1 henrikc 2003-09-09 11:46:50 UTC
Created attachment 9122 [details]
Sample text with repaired and broken 'æøå' chars. ä and ö are fine.
Comment 2 henrikc 2003-09-09 11:48:52 UTC
ä and ö, which are used in Swedish and Finnish, are fine, 
interestingly.
Comment 3 jack.warchold 2003-09-09 12:51:11 UTC
jw: reassigend to jw
Comment 4 jack.warchold 2003-09-09 12:57:43 UTC
hello henrikc, can you attach the word document please?
the rtf looks in OOo1.1rc4 like in the MS wordpad. 

copy and past it from any other into OOo does not change the swedisch 
letters in any way
Comment 5 henrikc 2003-09-09 13:50:47 UTC
Created attachment 9124 [details]
Word document showing broken chars
Comment 6 henrikc 2003-09-09 13:51:01 UTC
Created attachment 9125 [details]
Word document showing broken chars
Comment 7 henrikc 2003-09-09 13:55:30 UTC
Created attachment 9126 [details]
Word document showing broken chars
Comment 8 henrikc 2003-09-09 14:05:29 UTC
Created attachment 9127 [details]
Here's an OOo version of the file.
Comment 9 henrikc 2003-09-09 14:07:45 UTC
Sorry for the triple .doc file - site was slow in responding.


There's a good reason the original .RTF file looks like a WordPad file 
- WordPad uses the same RichText control that I created it with in my 
own application.


BTW, how do I ascertain exactly which OOo I'm currently running? I've 
seen the bug in rc4, but am not sure if I'm running rc3 or rc4 here.
Comment 10 henrikc 2003-09-09 14:13:51 UTC
Hm, to clarify: The problem occurs when pasting from OOo to other 
apps. 


And now I had a case when it works. Damn, thought it was systematic.


Comment 11 henrikc 2003-09-09 14:17:05 UTC
Possibly related behaviour:


When I paste from OOo to my RichEdit control, there are sometimes 
characters dropped from the text when format changes. For instance, if 
I have a word in Italics in the text, the space between the italicised 
word and the next gets lost in the paste. 


In the reverse direction, pasting from RichEdit app to OOo, I notice 
that line breaks frequently turn into page breaks.
Comment 12 henrikc 2003-09-09 15:30:04 UTC
FYI, I'm running a clipcache application that preserves every single 
cut&paste operation I've done for months. Feel free to request more 
examples.
Comment 13 lohmaier 2003-09-09 18:29:53 UTC
for the record: not reproducible (on linux)
Comment 14 jack.warchold 2003-09-10 10:32:51 UTC
sorry henrikc, i can not reproduce this neither. 
can you tell me the name of the clipcache program? maybe it just 
occurs in combination with this program?
Comment 15 henrikc 2003-09-10 11:07:14 UTC
The ClipCache (that's its name) will just do an after-the-copy paste 
into itself, probably not significant.


Instead, I'll get my hands on some Word documents that have the 
identical problem - my neighbour has a drive full of these. 


I'll be back :)
Comment 16 henrikc 2003-09-11 12:24:02 UTC
Created attachment 9196 [details]
MS Word 2000 file. Problem reproduced in OOo 1.1.0 rc8 Danish.
Comment 17 henrikc 2003-09-11 12:30:26 UTC
Created attachment 9198 [details]
Another Word2000 file with the problem. OOo 1.1.0 rc8, Danish
Comment 18 stp 2003-09-12 11:55:47 UTC
WFM in RC3 english build.

I suspect the problem is caused by the danish RC build and in that
case this issue should be assigned to Pavel Janik.
Comment 19 pavel 2003-09-12 12:00:21 UTC
@henrikc: Can you reproduce with original English RC4?
Comment 20 Unknown 2003-09-12 12:09:03 UTC
I get the bug on my clean windows XP system too. It is occuring 
consistently and appearently without exceptions. Someone suggested 
a "bad build"? This is the build used:

ftp://ftp.linux.cz/pub/localization/OpenOffice.org/devel/1.1.0-
RC4/build-8/OOo_11rc4_danish_Win32Intel_install.zip

OS: Windows XP SP1 Pro Danish
Comment 21 pavel 2003-09-12 12:29:51 UTC
We should separate two issues: bad cut&paste and bad import. We do not
change anything in cut&paste, but our patch which changes old DOC
import could be a potential problem. But without testing original RC4
English, I can not say more.
Comment 22 henrikc 2003-09-12 12:55:51 UTC
I've tested my files in English rc4, and the import bug does *not* 
exist there. Neither does the import problem, but that one has been a 
little more elusive, I'll try it out some more. Personally, I believe 
the two are the same issue.


Sure looks like a build problem. Pavel?
Comment 23 jack.warchold 2003-09-12 13:08:23 UTC
so. 

i installed the version tazly linked to
with that version the error is reproducible
even in the english version the error occurs after the installation of 
the danish version
after uninstalling the danish version and reinstalling the english one 
the error is not reproducible again. i think this is an error in the 
danish build. 

pavel is this your issue then?
Comment 24 pavel 2003-09-12 16:26:55 UTC
Then yes, it is a problem with the build.

How could I reproduce it the faster way?

Opening No.RTF or Humor.doc in OOo RC4?
Comment 25 henrikc 2003-09-12 16:50:00 UTC
Any of the two files would do, they're both small and show the same 
problem.
Comment 26 henrikc 2003-09-12 20:08:11 UTC
Sorry, misunderstood the question.
Take any of the last two files, those are verified to have the problem 
with rc4 build 8.
Comment 27 pavel 2003-09-12 22:33:29 UTC
Ressign to me.
Comment 28 pavel 2003-09-12 22:36:00 UTC
It is in fact cause by one of patches we use in our version. I have
conditionalized it for Czech version only thus Danish will not be
affected from build-9 on.

Please verify in build-9 which will be produced during weekend.

This is only about opening the file Humor.doc. Not about cut&paste.
Please file separate issue for that with good explanation. But as I
said, we do not patch anything about cut&paste...
Comment 29 pavel 2003-09-12 22:38:08 UTC
Please verify.
Comment 30 pavel 2003-09-12 22:42:19 UTC
-
Comment 31 pavel 2003-09-12 22:43:24 UTC
-
Comment 32 henrikc 2003-09-12 22:44:04 UTC
Will verify when build 9 is there, on Monday at latest.
Comment 33 henrikc 2003-09-12 22:45:26 UTC
Midair collissions abound. Did the status end up correct?
Comment 34 pavel 2003-09-13 09:21:01 UTC
Fixed build-9 for GNU/Linux is on its way to ftp.linux.cz.
Comment 35 henrikc 2003-09-13 09:29:59 UTC
Good.
Since I found the problem under Windows, a Windows build would be nice 
as well, or I'll not feel safe that the issue is resolved.
Comment 36 pavel 2003-09-13 09:32:56 UTC
Of course :-) Windows build will be a little bit later, approx. on
Tuesday.

Other people can at least confirm on Linux. But the final word is on
you anyway :-)
Comment 37 stp 2003-09-13 11:41:36 UTC
- added myself cc:

Henrik, please reassign this issue to Pavel.
Comment 38 pavel 2003-09-13 11:43:03 UTC
No, it is fixed from my side. I reassigned it back to Henrik for
verification.
Comment 39 stp 2003-09-13 11:50:11 UTC
OK, I will wait for you to build a Windows-version since the Windows
version caused the problem in the first place. Is the problem
confirmed in a danish Linux RC4-build?
Comment 40 pavel 2003-09-13 12:37:08 UTC
Yes, it is generic problem. I reproduced it with build-8 also on
GNU/Linux. build-9 is OK.
Comment 41 jack.warchold 2003-09-15 12:28:24 UTC
set target to 1.1.1
Comment 42 henrikc 2003-09-16 19:58:32 UTC
> set target to 1.1.1

   Say what? It's a critical MS-Word import bug. I'm just waiting for 
the Windows build to declare it fixed. Pavel?
Comment 43 pavel 2003-09-16 21:05:50 UTC
This problem is only present in our builds and is already solved in
Linux builds. Windows builds will follow the next week because Josef
is on vacation. It will be included 1.1.0 Danish final.
Comment 44 henrikc 2003-09-23 23:04:02 UTC
I've verified the problem is fixed in RC5. Setting new status.
Interestingly, that seems to be the case for cut&paste too - had no 
problems during a few operations. A power outage prevented me from 
testing this intensely, but I'll make noise if it turns out there's 
still trouble.
Now I'm wondering what that patch of our Chezk friends was about - 
will it still be applied to their version, leading to them having 
trouble importing the Scandinavian chars?  
Comment 45 pavel 2003-09-25 09:20:04 UTC
We have a problems with Czech characters without that patch. With it,
we are OK. This is a tradeoff: having Czech or not... It will be fixed
in 2.0 for all languages.

Thank you for verifying.