Issue 78095 - Pasting a surrogates character from MS office word to OOo produce an additive device.
Summary: Pasting a surrogates character from MS office word to OOo produce an additiv...
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: OOo 2.2
Hardware: PC Windows XP
: P3 Trivial (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
Depends on:
Reported: 2007-06-05 03:01 UTC by kangjch
Modified: 2017-05-20 11:17 UTC (History)
7 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---

the fixed codes (879 bytes, text/plain)
2007-06-28 03:08 UTC, kangjch
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description kangjch 2007-06-05 03:01:48 UTC
Preconditions: Microsoft Office2003, OOo2.2.0
As following is the steps for reproducing.
       1. Create a documents by Microsoft office word 2003.
       2. Input 0x20000 and press key Alt+x.
       3. Select and copy(Ctrl+c) this character.
       4. Created a new documents by OOo writer, Paste(Press Ctrl+x).
       5. Diplay additive device.

and I test many surrogate pairs (the code points are in 0x20000-0x22000 which 
belong to CJK EXT-B).
Comment 1 michael.ruess 2007-06-05 08:42:27 UTC
Reassigned to ES.
Comment 2 kangjch 2007-06-05 09:18:14 UTC
In Steps 3: Select All(Ctrl+A) and copy(Ctrl+c) this character.
Comment 3 kangjch 2007-06-08 08:15:01 UTC
According to Stephen Bergmann, I'd like to investigate it.
Comment 4 kangjch 2007-06-21 10:12:24 UTC
Now I retest it in SRC680_m215.
In the upcoming Step 4. Created a new documents by OOo writer, Paste(Press 
Ctrl+x). still wrong characters.

Besides, direclty pasting, When I paste as folowing special methods
 1. "Microsoft office word documents", displaying a frame, double click it, MS 
word can be opened, the right character is shown in it.
 2. "Html Format", display a yellow background's frame, a ? is in it.
 3. "Html Format without format", only a '?' is shown.
*4. "Formated Text(RTF)", Right character is shown.
*5. "Unformated Text", Right character is shown.
*6. "GDI metafile", also displaying a frame, in it the right character is shown.

Notes: "*n" the right(wanted) case.

Comment 5 Stephan Bergmann 2007-06-22 12:21:11 UTC
@kangjch:  So, case 1 ("Microsoft office word documents") also works correctly,
right?  That only leaves cases 2 and 3 (HTML) as broken (plus the default copy
action, which appears to be one of 2 or 3; I asked os which of the cases is
chosen as the default, but he could not tell offhand).  For clipboard import of
HTML into Writer, code in sw/source/filter/html is used, so you should look there.
Comment 6 liujiaxiang 2007-06-27 09:22:04 UTC
Comment 7 liujiaxiang 2007-06-27 09:22:43 UTC
Comment 8 kangjch 2007-06-28 03:08:48 UTC
Created attachment 46326 [details]
the fixed codes
Comment 9 kangjch 2007-06-28 03:10:05 UTC
I have attached the modifying codes named patch.diff, please check it.
Comment 10 Stephan Bergmann 2007-06-28 07:40:41 UTC
kangjch, re the attached patch.diff:  As I already wrote at
<>:  "That's
cheating, I would say:  The user requests to insert HTML, but you 
change that to RTF instead.  (Plus, it does not work if there is no RTF 
available for the clipboard data.)  I guess the real fix would be to the 
ReadHTML function instead..."
Comment 11 Mathias_Bauer 2007-12-04 09:08:36 UTC
Stephan, please take over.
Comment 12 Mathias_Bauer 2007-12-04 12:36:25 UTC
following release status meeting -> target 3.x
Comment 13 Stephan Bergmann 2008-04-16 13:25:50 UTC
Comment 14 Marcus 2017-05-20 11:17:56 UTC
Reset assigne to the default "".