Apache OpenOffice (AOO) Bugzilla – Full Text Issue Listing |
Summary: | html export: charset utf-8 uses named entities | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Product: | Writer | Reporter: | Regina Henschel <rb.henschel> | ||||||||
Component: | ui | Assignee: | AOO issues mailing list <issues> | ||||||||
Status: | ACCEPTED --- | QA Contact: | |||||||||
Severity: | Trivial | ||||||||||
Priority: | P5 (lowest) | CC: | che, issues, mey.wer, pavel, pmladek, xslf | ||||||||
Version: | OOo 1.1 RC2 | Keywords: | oooqa | ||||||||
Target Milestone: | --- | ||||||||||
Hardware: | PC | ||||||||||
OS: | All | ||||||||||
Issue Type: | ENHANCEMENT | Latest Confirmation in: | --- | ||||||||
Developer Difficulty: | --- | ||||||||||
Attachments: |
|
Description
Regina Henschel
2003-09-13 02:02:04 UTC
Please Attach the documents which make this problem, so we can test it/faster to confirm. (Without the documents, we cannot confirm the problem easily/need more time) Don't forget to cut other part of the documents, so the file size is small, but we still able to see the problem. Created attachment 9949 [details]
example produced HTML
The attached file was produced by 'new HTML-DOcument'. The behavior is indepent of the field 'Export' in the 'HTML Compatibility'-dialog. 'Character Set' in that dialog was set to 'UTF-8'. Created attachment 9950 [details]
how the document should be
The document testspecialcharacter_correct shows the correct coding of umlaut 'ü' in UTF-8. confirming. But since this doesn't produce broken HTML, I set this to Prio5 (changed subject, OS to ALL) original summary: html export: information in help doesn't fit to behavior Reassigned to ES ES->MIB: Please evaluate To offer as much compatibility as possible, the HTML export in fact uses (named) entities for as much characters as possible. One can consider this to be a bug or a feature ... Created attachment 15618 [details]
This patch fixes the problem for ISO8859-1 and MS 1250
A similar patch which fixes the problem for ISO 8859-7 and MS 1253 is mentioned in the Issue #28241. *** Issue 53483 has been marked as a duplicate of this issue. *** Problem occures still in OOo2.0 And there is yet another problem - maybe should be a new bug/issue: "To offer as much compatibility as possible, the HTML export in fact uses (named) entities for as much characters as possible. One can consider this to be a bug or a feature ..." That's not true at all! OO replaces „ “ and other by " (99 down) and " (66 up)! I don't like (I hate), that OOo replaces code, which was created manually before. |