Apache OpenOffice (AOO) Bugzilla – Issue 34698
Importing a WORD document OOo inserts "linefeeds" in lists
Last modified: 2004-10-08 12:35:41 UTC
When importing (some) bulleted Word document, additional lines and bullets are added, and others are removed. A Word97 saved document to show this behaviour is available. (used OOo 1.1.3RC3)
Created attachment 17965 [details] Import behaviour: outline and bulleting changes;line addition
Occured also in previous versions.
I checked with 1.1.2RC3 (German) WIN XP: [645m44(Build8784)] and can confirm that OOo inserts "Linefeeds" and also bullets (after text "Kosten kunnen zonder ..." for an empty textline. In the attached screenshot you can see that very often OOo inserts "Linefeeds", as you see in the red marked paragraphs (same number for same paragraph). But sometimes OOo imports correct from the document, as you see green marked. It would be interesting to know a way to make that reproducible with WORD. I cant't believe that this is a general problem, we would have had bug reports. For me the question is, what special circumstances in the example document cause that problem. Reporter, can you contribute more information? As long as we do not find out that that is a general WORD import problem, P4 should be enough. Rainer
Created attachment 17980 [details] screenshots comparing text result in WORD2000 and OOO
Oops, in "incorrect_paragraphs.sxd" both "No1" examples should be marked red for "wrong import", only "11" is green for 'correct'. Rainer
I checked with 2.0 (1.9.m51) English version WIN XP: [680m51(Build8801)] and saw the same problem. Rainer
I made some tests with WORD2000 and created some small texts with lists, all has been imported without problem. Must be something special with "OORefDoc.doc"? Rainer
Rainer, I'm not sure if the document contains non-standard items. It may well have to be related to bulleting (e.g. levels change) I remember import bulleting issues open. Don't know if it's worth a P3. I did observe a failing import. And a resulting document (LF's removed, bulleting made moer clean and saved from OOo in doc format) wasn't correctly shown in a receiving user's Word installation. I guess it's not only an import but also an export issue, related to something odd in this doc.
When opening OORefDoc.doc with changes in the document shown (Edit - Changes - Show in OO.o, I am not sure about Word menus, because I am using Croatian version), you can see that the additional lines shown in incorrect_paragraphs.sxd had been there, but were deleted by one of the reviewers. So I think the real question is why text was correctly deleted and empty paragraphs were not.
Created attachment 17992 [details] File showing empty paragraphs deleted with change tracking turned on in MS Word. Empty paragraphs are not deleted in OOo, only the third paragraph with text (the letter "c").
sakitc, *that* sounds interesting! Rainer
What I did before I send the document: The original document was larger. I deleted (in Word) some 3 pages. But the behaviour remained the same. I'm happy to enter the original, although I guess it will not add new information: the behaviour is illustrated.
Created attachment 17993 [details] More examples of (in)correctly importing deleted new lines in revised files
In the above attachment (b.doc) you can see that only deleted new lines contained within deleted text are deleted when you import revised MS Word documents into OO.o.
To be more precise: when I said "revised text" in my last comment, I meant "text edited with change tracking turned on". Anyway, you should be able to see for yourself, if you compare b.doc when opened in MS Word and when imported in OO.o. If this issue affects all documents in which tracked changes haven't been accepted/rejected, perhaps it would make sense to raise priority a bit?
Created attachment 18052 [details] b.doc reproduced in OO.o
I created another attachment: b-native.sxw, in which I repeated everything I did when creating b.doc. The difference in the <tracked changes>...</tracked changes> part of the content.xml files of b.doc and b-native.sxw can be seen in the following "printouts" of the relevant sections of content.xml for the imported (from MS Word) and native (OO.o) files: b.doc <text:tracked-changes> <text:changed-region text:id="ct62595960"> <text:deletion> <office:change-info office:chg-author="tomislav.cepulic" office:chg-date-time="2004-09-29T13:49:00"/> <text:p text:style-name="Standard">deleted</text:p> </text:deletion> </text:changed-region> <text:changed-region text:id="ct62825808"> <text:deletion> <office:change-info office:chg-author="tomislav.cepulic" office:chg-date-time="2004-09-29T13:49:00"/> <text:p text:style-name="Standard">deleted</text:p> <text:p text:style-name="Standard">deleted </text:p> </text:deletion> </text:changed-region> </text:tracked-changes> b-native.sxw <text:tracked-changes> <text:changed-region text:id="ct172361400"> <text:deletion> <office:change-info office:chg-author="Tomislav Cepulic" office:chg-date-time="2004-10-01T15:27:00"/> <text:p text:style-name="Standard"/> <text:p text:style-name="Standard"/> </text:deletion> </text:changed-region> <text:changed-region text:id="ct172198304"> <text:deletion> <office:change-info office:chg-author="Tomislav Cepulic" office:chg-date-time="2004-10-01T15:27:00"/> <text:p text:style-name="Standard"/> <text:p text:style-name="Standard"/> </text:deletion> </text:changed-region> <text:changed-region text:id="ct172308792"> <text:deletion> <office:change-info office:chg-author="Tomislav Cepulic" office:chg-date-time="2004-10-01T15:27:00"/> <text:p text:style-name="Standard">deleted</text:p> <text:p text:style-name="Standard"/> </text:deletion> </text:changed-region> <text:changed-region text:id="ct172296272"> <text:deletion> <office:change-info office:chg-author="Tomislav Cepulic" office:chg-date-time="2004-10-01T15:27:00"/> <text:p text:style-name="Standard">deleted</text:p> <text:p text:style-name="Standard">deleted </text:p> </text:deletion> </text:changed-region> </text:tracked-changes> You can see that b-native.sxw (or its content.xml) didn't lose any of the four changes, and b.doc lost all the empty paragraphs (<text:p text:style-name="Standard"/> parts in b-native.sxw's content.xml)
Of course, when I said "content.xml of b.doc" in my last post, I meant "content.xml of the OpenOffice.org file saved after importing b.doc into OpenOffice.org."
Known issue, problem here is, that OO's concept with numbering/bullets in tracked changes in a bit different form MS Word's. Duplicate to issue 18507. *** This issue has been marked as a duplicate of 18507 ***
Closing duplicate.