Issue 18675 - when saving a hebrew file to HTML, punctuation is incorrectly marked as english
Summary: when saving a hebrew file to HTML, punctuation is incorrectly marked as english
Status: ACCEPTED
Alias: None
Product: Internationalization
Classification: Code
Component: BiDi (show other issues)
Version: OOo 1.1 RC3
Hardware: PC Windows XP
: P3 Trivial with 1 vote (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
URL:
Keywords:
Depends on: 16354
Blocks:
  Show dependency tree
 
Reported: 2003-08-26 09:08 UTC by sforbes
Modified: 2013-08-07 15:00 UTC (History)
2 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
original write file (5.36 KB, application/octet-stream)
2003-08-26 09:09 UTC, sforbes
no flags Details
exported html (1.18 KB, text/html)
2003-08-26 09:10 UTC, sforbes
no flags Details
test file created directly in the html editor (879 bytes, text/html)
2003-08-26 09:10 UTC, sforbes
no flags Details
simpel test file created with OOo 680m26 (5.20 KB, application/vnd.sun.xml.writer)
2004-02-23 11:06 UTC, sforbes
no flags Details
HTML created by OOo 680m26- notice that the punctuation is in an English span (671 bytes, text/html)
2004-02-23 11:07 UTC, sforbes
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description sforbes 2003-08-26 09:08:51 UTC
OO1.1RC3, running on winXP, Hebrew local. HTML export encoding set as windows-1255

I created a simple document using styles and no ad-hoc formatting.
Saved it as HTML.

Looking at the code, it seems that all punctuation is enclosed within  <SPAN
LANG="en-US">, which is incorrect- the puncuation is not in English, and marking
it as english could lead to incurrect browser rendering (the browser choosing a
diffrent font for the punctuation).

I got the same result with the HTML editor
Comment 1 sforbes 2003-08-26 09:09:28 UTC
Created attachment 8764 [details]
original write file
Comment 2 sforbes 2003-08-26 09:10:01 UTC
Created attachment 8765 [details]
exported html
Comment 3 sforbes 2003-08-26 09:10:57 UTC
Created attachment 8766 [details]
test file created directly in the html editor
Comment 4 mehlng 2003-08-26 09:32:20 UTC
This is the correct behaviour as currently for some reason all
punctuations are considered Western.
When this will be fixed this not-really-problem will be fixed as well.

*** This issue has been marked as a duplicate of 16354 ***
Comment 5 sforbes 2003-08-26 09:50:07 UTC
They are related, but not duplicate- even if issue 16354 is fixed,
there  is no telling that this will be fixed as well.
Comment 6 mehlng 2003-08-26 10:03:49 UTC
well I won't argue, but I think that changing all puctuations into
Hebrew fonts will defenitely solve the problem.
Anyhow search yourself - if you think there's anything to fix let's
leave it open.
I hope that the puctuations issue will be solvede quickly.
Comment 7 Dieter.Loeschky 2003-08-27 11:38:46 UTC
DL->MIB: Would you please takeover?
Comment 8 michael.brauer 2003-08-27 11:55:24 UTC
The HTML export uses the same code as the UI to figure out the font
and langauge for all characters, so if 16354 is fixed, this bug will
be fixed as well.

*** This issue has been marked as a duplicate of 16354 ***
Comment 9 sforbes 2004-02-23 11:05:22 UTC
Tested with 680M26, which accroding to the release notes
( http://development.openoffice.org/releases/680_m26_snapshot.html ) has the fix
for issue #16354 which this bug was marked duplicate of- 

This problem still occores, and doesn't seem to be fixed by the fix to issue #16354

See newsample files
Comment 10 sforbes 2004-02-23 11:06:40 UTC
Created attachment 13355 [details]
simpel test file created with OOo 680m26
Comment 11 sforbes 2004-02-23 11:07:25 UTC
Created attachment 13356 [details]
HTML created by OOo 680m26- notice that the punctuation is in an English span
Comment 12 michael.brauer 2004-06-04 16:21:04 UTC
To my surprise, #i16354# did not solve the issue.
Comment 13 sforbes 2004-06-21 14:44:36 UTC
adding dina as cc