Issue 110486

Summary: Calc/Writer: data loss on import table from .html file
Product: Calc Reporter: bormant <bormant>
Component: uiAssignee: AOO issues mailing list <issues>
Status: CONFIRMED --- QA Contact:
Severity: Trivial    
Priority: P2 CC: helenrussian, issues
Version: OOO320m14   
Target Milestone: ---   
Hardware: All   
OS: All   
Issue Type: DEFECT Latest Confirmation in: ---
Developer Difficulty: ---
Attachments:
Description Flags
test file none

Description bormant 2010-03-30 10:09:41 UTC
Experiment 1:
1) start OOo
2) File - Open - test.html, HTML Document (OpenOffice.org Calc)
3) document opens in calc (be patient), 
4) navigate to F column, press Ctrl+Down arrow

As for me, column F breaks at row 10923, column A breaks at row 10924 (error).
Columns B and C are complete.

Experiment 2:
1) start OOo
2) File - Open - test.html, All files
3) document opens in writer/web (be patient),
3) navigate to F:10667 in table

As for me, table breaks in column F at row 10667, other data follow the table 
(error).
Comment 1 bormant 2010-03-30 10:10:53 UTC
Created attachment 68632 [details]
test file
Comment 2 bormant 2010-03-30 10:28:20 UTC
Experiment 3:
1) open test.html in browser
2) select and copy whole table
3) open Calc
4) paste (Ctrl+V)

As for me, column F breaks at row 10923, column A breaks at row 10924 (error).
Columns B and C are complete.

If Paste special (Ctrl+Shift+V) - Unformatted text on step 4 -- complete table 
goes in document (without formatting, of course).
Comment 3 helenrussian 2010-03-30 11:32:21 UTC
I confirm experiments 1 and 2 with OOo 3.2 on Linux as described.

In experiment 3 my browser hangs. :(
Comment 4 gquigs 2010-05-21 07:01:55 UTC
Another example file available on launchpad bug:
https://bugs.launchpad.net/ubuntu/+source/openoffice.org/+bug/480130

In this file it cuts off at 6555. This bug appears in Ubuntu's OOo, Go-oo on
Windows and OpenOffice 3.2.1rc.  

Tested all files with both Excel and Firefox, nothing is wrong with the html in
this bug report or launchpads.
Comment 5 gquigs 2010-05-21 07:40:19 UTC
You can create a new document that exhibits this behavior by using Excel:
File up columns 1-4 with data down to row 18,000. Can just be 1-17,999.
Save it as a Web Page from excel.

Open in OpenOffice.  Scroll down to the end, you will notice that instead of
being a table with multiple columns, there is only one column at the bottom
(it's cut off, but the data looks like it has been merged into it. 

Bug still exists in DEV300m77 on Windows.
Comment 6 Olaf Felka 2010-05-21 07:50:19 UTC
This should be better reviewed by the Spreadsheet folks.