Issue 21593 - Bad html to e-mail export
Summary: Bad html to e-mail export
Status: CLOSED IRREPRODUCIBLE
Alias: None
Product: Writer
Classification: Application
Component: ui (show other issues)
Version: OOo 1.1
Hardware: Other Windows ME
: P3 Trivial (vote)
Target Milestone: ---
Assignee: eric.savary
QA Contact: issues@sw
URL:
Keywords: needmoreinfo, oooqa
Depends on:
Blocks:
 
Reported: 2003-10-23 01:06 UTC by waltercz
Modified: 2013-08-07 14:38 UTC (History)
6 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Source HTML file (595 bytes, text/html)
2003-12-28 20:00 UTC, waltercz
no flags Details
O.O setting (24.35 KB, image/gif)
2003-12-28 20:01 UTC, waltercz
no flags Details
Generated e-mail (1.44 KB, text/html)
2003-12-28 20:04 UTC, waltercz
no flags Details
original Hebrew HTML (547 bytes, text/html)
2003-12-29 16:41 UTC, sforbes
no flags Details
emailed Hebrew HTML (624 bytes, text/html)
2003-12-29 16:41 UTC, sforbes
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description waltercz 2003-10-23 01:06:34 UTC
In case of setting any charset in HTML compatibility (e.g.WIN-1250/WIN latin 
2)  is every time generated the same HTML Head in "send as e-mail" with 8859-2 
charset. But the text are coded in selected charset. 

The head is following:

<META http-equiv=Content-Type content="text/html; charset=iso-8859-2"

You may write in Czech, English or German language.

Walter
Comment 1 stx123 2003-11-07 16:27:50 UTC
moving to Word Processor
Comment 2 stx123 2003-11-07 16:28:30 UTC
Reassign issue to owner of selected subcomponent
Comment 3 Rainer Bielefeld 2003-12-01 06:33:00 UTC
Hi Waltercz,

I believe I watched similar problems for website creation, but I was
not able to figure out the problem exactly (insufficient knowledge
concerning charsets) - so I need your help. Can you please attach some
(very!) short documents to show some details?

Helpful would be 
- source document (.sxc), may be with only 1 sentence
- document transferred to html with some short explications concerning 
  charset
  - shown in header 8visible for everyone)
  - in fact (with some hints where in the text the actual charset
    can be proved like "'ê' only exists in iso - xxx")

Do you see the same problems in CALC - documents saved as HTML ?

Do you agree that something like "charset indicated in html document
header not in accordance to actual charset in the document) might be a
more accurate summary for this issue?

Thanks

Rainer
Comment 4 Rainer Bielefeld 2003-12-01 07:12:07 UTC
Issues that might be related to this one:
issue 4453, issue 9128, issue 10373, issue 17037, issue 19514

Rainer
Comment 5 Rainer Bielefeld 2003-12-26 08:52:02 UTC
We will duscuss this in  [qa-dev] 

Rainer
Comment 6 settantta 2003-12-26 09:15:04 UTC
In OO.o 680 I have charset as utf-8 (Tools-->Options-->Load/Save-->HTML Compatibility). Using 
View-->HTML-Source, headers show the correct charset (<META HTTP-EQUIV="CONTENT-TYPE" 
CONTENT="text/html; charset=utf-8">. Have not yet tested in 1.1.0...
Comment 7 settantta 2003-12-26 09:20:02 UTC
Just repeated the same steps in 1.1.0, with the same results, the header still shows utf-8 as 
charset... This is all under Linux, perhaps the problem is only under Windows?
Comment 8 askoorb 2003-12-26 11:08:16 UTC
I have win98 and live in the UK, on OOo 1.1.0  My defult Character Set for HTML 
Compatability is Western Europe (Windows-1252/Winlatin 1) And I have no 
problems with bad coding on this charset, or unicode UTF-8.
Comment 9 askoorb 2003-12-26 15:52:51 UTC
When the defult Charset is changed between UTF-8, ISO8859/1 and Windows-
1252/Winlatin 1, the tags in the exported html refer to these.  I have no 
problems with bad tags on my set up.
Comment 10 Rainer Bielefeld 2003-12-27 17:36:32 UTC
I was not able to reproduce the problem, too.

Reporter, please attach a very detailled step by step instruction and also
source- and email document and allso other information that might be useful for
further clarification.

If I will not see any further action as votes, attachments or confirmations in
this issue, I will have to close this issue 2004-01-15 as WFM.

CU

Rainer
Comment 11 settantta 2003-12-27 23:48:16 UTC
Rainer et al... I suspect the reporter may be Czech (since he uses "cz" as part of his username 
and refers to teh Czech lnaguage)

Could this bug be specific to either locale seetings, or to a specific localiization? I would be very 
interested to know whether the reporter is using a Czech localization or not.

Perhaps this could be referred to the relevant project? I'll send a precis to Pavel Janik and ask if 
he can reproduce with the cs localization.
Comment 12 pavel 2003-12-28 10:18:43 UTC
waltercz: which version do you use? Do you use 1.1.0cz or original English version?

Could you please only try export as HTML instead of e-mail? Your e-mail program
can convert the HTML page...

I can not reproduce it with original English version of 1.1.0 nor with Czech
versions. The resulting HTML page is always encoded as specified in Tools /
Options / Loading/Saving / HTML compatibility / Character set.
Comment 13 waltercz 2003-12-28 20:00:42 UTC
Created attachment 12147 [details]
Source HTML file
Comment 14 waltercz 2003-12-28 20:01:33 UTC
Created attachment 12148 [details]
O.O setting
Comment 15 waltercz 2003-12-28 20:04:55 UTC
Created attachment 12149 [details]
Generated e-mail
Comment 16 waltercz 2003-12-28 20:06:14 UTC
Dear Reiner,

Sorry for late reply, because I was very bussy.

You may find the following attachments:

Source - source html file
Output  - an e-mail output
seting - O.O. setting (gif file)

The e-mail source is following  (charset=iso-8859-2, but text are coded in
Windows 1250):

-----------
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD><TITLE></TITLE>
<META http-equiv=Content-Type content="text/html; charset=iso-8859-2">
<META content="MSHTML 5.50.4134.100" name=GENERATOR>
<META content="Walter Müller" name=AUTHOR>
<META content=20031228;17434215 name=CREATED>
<META content="Walter Müller" name=CHANGEDBY>
<META content=20031228;18093215 name=CHANGED>
<STYLE></STYLE>
</HEAD>
<BODY lang=cs-CZ dir=ltr bgColor=#ffffff>
<DIV>&nbsp;</DIV>
<DIV style="FONT: 10pt arial">----- Original Message ----- 
<DIV style="BACKGROUND: #e4e4e4; font-color: black"><B>From:</B> <A
title=muller@waj.cz href="mailto:muller@waj.cz">Walter Müller</A> </DIV>
<DIV><B>Sent:</B> Sunday, December 28, 2003 6:10 PM</DIV>
<DIV><B>Subject:</B> Output</DIV></DIV>
<DIV><BR></DIV>
<DIV>&nbsp;</DIV>
<P>PříšernÄ› žluÅ¥ouÄký kůň pÄ›l Äábelské ódy.</P>
<P><BR><BR></P>
<P><BR><BR></P></BODY></HTML>
-------------------


(System : Windows ME, OO 1.1.0)

Regards

Walter Müller
Comment 17 Joost Andrae 2003-12-28 20:15:14 UTC
JA->ES: reassigned to you. please have a look at it...
Comment 18 pavel 2003-12-28 20:19:19 UTC
es: czech version uses one specific patch that could make a difference here.

See
ftp://ftp.linux.cz/pub/localization/OpenOffice.org/devel/build/Patches/OOo_1.1.1_source-do-not-export-chars-as-html-entities.diff

waltercz: could you please test original English version too?
Comment 19 Rainer Bielefeld 2003-12-29 07:40:13 UTC
I checked with 1.1.0 German version WIN98SE: 645m19(Build8693):

0. Start OOo WRITER
1. change settings in accordance to 'Setting.gif'(I only changed charset 
   to 'windows-1250_)
2. open new html-document
3. open attachment 'source.html'
4. Menu: 'File - send document - as email
   a new email document opened in MOZILLA 1.6b
5. Save email as draft
6. goto email folder "drafts"
7. look for saved email
8. rightclick on attached html-file, 'save'
9. open saved html-file in OOo (from webpage- UI)
10. change to view "html-source" for just opened file
11. goto OOo- window with source file
12. change to 'source view', too
13. compare source
    expected: same in both windows
    actual: changed lines:
    	<META NAME="CHANGEDBY" CONTENT="Rainer Bielefeld">
        <META NAME="CHANGED" CONTENT="20031229;8133763">
        <P>PÅ™&iacute;&scaron;ernÄ› žluÅ¥ouÄk&yacute; kůň pÄ›l &aacute;belsk&eacute;
            &oacute;dy.</P>	
These changes are not so extreme like in the attached 'Output e-mail.eml' (and
who can know, what MOZILLA, OE and 'eml-export' will do with the files), but it
seems that a discussion of these results might be useful.
My OOo will always masque some fonts in the body, when I save 'source.html'

Reporter, can you please:
contribute an email attachment in html created in accordance to my 5 ... 8?

I 

Rainer


 
Comment 20 sforbes 2003-12-29 16:39:33 UTC
Tested this with Mozilla 1.5 and Hebrew OpenOffice 1.1 on Linux, with a simple
Hebrew file (one line).
Local is set to Hebrew, HTML charset set to windows-1255.

Compared both files in a text editor (katoob): the email file had an extra line
at the top, with the charset declaration repeated (outside the HEAD element,
which is wrong).

In OOo Write:
When viewing the source, for some reason the Hebrew in the Origianl version
displayed incurrectly as high ansi characters, while the version in the email
displays properly.

Very strange.
Comment 21 sforbes 2003-12-29 16:41:02 UTC
Created attachment 12170 [details]
original Hebrew HTML
Comment 22 sforbes 2003-12-29 16:41:32 UTC
Created attachment 12171 [details]
emailed Hebrew HTML
Comment 23 Rainer Bielefeld 2003-12-29 17:00:12 UTC
Hi Shoshannah,

I saw your funny email file, but I can not reproduce the effect with
1.1.0 German version WIN98SE: 645m19(Build8693)   and 
Mozilla/5.0 (Windows; U; Win98; de-AT; rv:1.6b) Gecko/20031208

My exported email file from 1.html looks quite normal.

But it seems that there _are_ several difficulties, even if they are not really
reproducible. So: NEW


Rainer
Comment 24 askoorb 2003-12-29 18:19:59 UTC
I can not reproduce the problem on Win98 OOo1.1 English version.

Also, though I think it is correct, is the subcomponent right above?
Comment 25 eric.savary 2004-05-06 16:59:44 UTC
Can someone reproduce anything anywhere now in a current version?
If yes, please detail if the problem can be reproduced with different mailers.
I also think it's a framework problem (IF it is an OOo problem).

Else I'll close WEM
Comment 26 waltercz 2004-06-10 20:49:54 UTC
Hi es! 

I have the Czech version 1.1.1 - the problems are remaining. If there is any 
newest version (e.g. beta), please send me link for download, I will check it.

Regards

Walter
Comment 27 eric.savary 2004-12-17 02:44:48 UTC
Current version is 1.1.4.
Test vesrion is src680m66
Please have a look at it and if you have the same problem, reassign to 'tra'
Comment 28 eric.savary 2004-12-17 02:45:11 UTC
closed