Issue 5276 - HTML preview mode cannot handle large amounts of accents and punctuation
Summary: HTML preview mode cannot handle large amounts of accents and punctuation
Status: CLOSED DUPLICATE of issue 17171
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: OOo 1.0.0
Hardware: PC Windows ME
: P3 Trivial (vote)
Target Milestone: ---
Assignee: michael.brauer
QA Contact: issues@sw
URL:
Keywords: oooqa
: 23897 (view as issue list)
Depends on:
Blocks:
 
Reported: 2002-05-25 18:53 UTC by Unknown
Modified: 2013-08-07 14:38 UTC (History)
2 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Cut down version of the HTML file in the user provided link. (26.88 KB, application/octet-stream)
2002-10-20 16:18 UTC, prgmgr
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description Unknown 2002-05-25 18:53:15 UTC
Trying to load and edit a large (1M) all-text HTML document, I noticed that HTML
editor and word processor doesn't show all of the text but only about 8% of it.
The HTML editor shows the complete source but saves only the displayed part when
I try to export it.

But, oddly, it takes a small <pre> text that's at the end of the document and
displays it at the end of the part of the document it's displaying.
I even tried to remove the <pre> section, same result: the text abruptly ends at
21 out of 318 rows (as measured in NoteTab)
Comment 1 Unknown 2002-05-25 18:55:47 UTC
You can find the original document at
http://abu.cnam.fr/cgi-bin/donner?nddp1

Comment 2 prgmgr 2002-09-06 04:48:17 UTC
Federico, thanks for posting.

I tried downloading the web page at the URL you provided, but it was 
only about 27Kb.

Do I have the right web page?
Comment 3 Unknown 2002-09-10 14:43:34 UTC
They changed something, now it's at

http://abu.cnam.fr/cgi-bin/donner_html?nddp1
 
Federico Spano`
Comment 4 prgmgr 2002-09-20 02:48:53 UTC
Just want to add some notes to this issue.

It is not the size of the file that is causing the problem.  My guess 
is that there is a hidden character that is causing OOo to only 
display part of the HTML even though it has loaded the full source.

The HTML in the original link is broken.  I've run it through W3 Tidy 
but the problem still exists.

I'm trying to isolate the part of the file that is causing the issue.
Comment 5 prgmgr 2002-10-20 16:15:47 UTC
Duplicated on Win NT 4.0 SP6a, OOo 643.

User summary:
HTML editor can't show and export large documents

Summary:

User originally reported that the document at the link was loaded by 
the HTML editor, but in the HTML preview mode only part of the HTML 
page was displayed.

If you switch to the HTML source mode, the entire document was 
present.

The problem is the number of accents and punctuation marks in the 
HTML file.  Once I start stripping out some of the accents and 
punctuation marks, I start seeing more of the HTML page in the HTML 
preview mode.

Will attach a stripped down test case.
Comment 6 prgmgr 2002-10-20 16:18:11 UTC
Created attachment 3243 [details]
Cut down version of the HTML file in the user provided link.
Comment 7 prgmgr 2002-10-20 16:19:14 UTC
In the attached HTMl file, try removing punctuation marks or the HTML 
accents.  You'll see more and more of the HTML doc in the HTML 
preview mode.

Note, I ran the HTML file at the user link through Tidy before I 
started to trim it down.
Comment 8 eric.savary 2003-01-30 19:24:10 UTC
to me
Comment 9 eric.savary 2003-02-11 11:22:51 UTC
ES->MIB: in source mode, the text ends with ", il tomba en 
poussière.". In WYSIWYG mode, ends with "cardinal tout décontenancé 
et" which indeed represents only 8% of the text.
Comment 10 michael.brauer 2003-02-25 08:20:07 UTC
The issue is that the document contains a single paragraph, and that
the paragarph size is limited to 65535 characters on OOo.
Comment 11 michael.brauer 2003-04-09 13:50:06 UTC
Fixed in CWS sw009
Comment 12 michael.brauer 2003-04-09 13:57:04 UTC
Last comment was wrong.
Comment 13 michael.brauer 2003-04-30 06:57:07 UTC
There are two options to solve this:
1. Increasing the maximum paragraph size
2. Adding a paragraph break when the maximimum paragarph size is reached.

Since paragraphs that have more than 65535 are an exception and in
fact not very useful and since the paragraph size is limited by the
string size (that is 65535 as well), option 2. seems to be the
apropriate solution.
Comment 14 andreas.martens 2003-09-12 14:59:22 UTC
.
Comment 15 eric.savary 2003-09-30 17:01:41 UTC
According to the OpenOffice.org roadmap 
(http://tools.openoffice.org/releases) this issue was retargeted to 
OOo Later.
Comment 16 michael.brauer 2004-01-12 09:14:20 UTC
*** Issue 23897 has been marked as a duplicate of this issue. ***
Comment 17 lohmaier 2005-01-20 19:24:46 UTC
mib wrote: "The issue is that the document contains a single paragraph, and that
the paragarph size is limited to 65535 characters on OOo."

thus this is a duplicate of issue 17171

please decide which one to keep and mark the other one as duplicate. (if keeping
this one, don't forget to change the summary...)
Comment 18 eric.savary 2005-08-18 10:45:55 UTC
duplicate

*** This issue has been marked as a duplicate of 17171 ***
Comment 19 eric.savary 2005-08-18 10:46:09 UTC
closed