Issue 75653

Summary: HTML import doesn't recognize UCS-2 encoding.
Product: Calc Reporter: pagalmes.lists
Component: open-importAssignee: AOO issues mailing list <issues>
Status: ACCEPTED --- QA Contact:
Severity: Trivial    
Priority: P3 CC: issues, jbf.faure, kpalagin, rb.henschel
Version: OOo 2.1   
Target Milestone: ---   
Hardware: All   
OS: All   
Issue Type: DEFECT Latest Confirmation in: ---
Developer Difficulty: ---
Attachments:
Description Flags
under Excel
none
html file none

Description pagalmes.lists 2007-03-22 15:01:19 UTC
- See how things look under Excel.
- To open the file under OOo, see issue 70981

"Start OpenOffice.org and then File - Open. Choose the document
and then choose the file typ "HTML Document (OpenOffice.org Calc)
(*.html;*.htm)". You find this in the section, which begins with "OpenOffice.org
Spreadsheet"."

=> the chart is not displayed \and the data not well imported
Comment 1 pagalmes.lists 2007-03-22 15:01:52 UTC
Created attachment 43869 [details]
under Excel
Comment 2 pagalmes.lists 2007-03-22 15:02:31 UTC
Created attachment 43870 [details]
html file
Comment 3 pagalmes.lists 2007-03-22 15:19:27 UTC
I just wanted to add that this issue will impact many big companies.
This HTML-XLS format is used by many critical software for companies 
like "wmware", "Cognos"...

One of the company being impacted represents more than 10000 PC.
Comment 4 kpalagin 2007-03-25 20:56:10 UTC
Confirming with 2.2m11 - Calc would not correctly open "CPU.xls" with steps 
"Start OpenOffice.org and then File - Open. Choose the document
and then choose the file typ "HTML Document (OpenOffice.org Calc)
(*.html;*.htm)". 
Comment 5 kpalagin 2007-03-25 20:57:57 UTC
*** Issue 75654 has been marked as a duplicate of this issue. ***
Comment 6 frank 2007-03-30 14:07:40 UTC
Hi,

the Chart can't be displayed because it is not a Graphic but will be drawn by
Excel if it loads this file. If you try to open this doc with a browser the
isn't displayed too.

From my point of view this file isn't a valid HTML file as it starts with two
bytes containing non printable characters. Therefore this is IMHO an invalid
Issue. Maybe Eike can shed some light on this. So I let him decide what to do
with this Issue.

Frank
Comment 7 ooo 2007-04-12 18:05:47 UTC
The file is in UCS-2LE (16-bit Unicode Little Endian) text encoding, which isn't
recognized by the HTML filter. This may be fixed, which I accept this issue for.

However, even if the HTML part would be imported, all the extra functionality
would not work because it is accomplished using private <x:...> elements and
x:... attributes specific to Excel.
Comment 8 frank 2007-04-25 10:58:53 UTC
*** Issue 75654 has been marked as a duplicate of this issue. ***
Comment 9 pagalmes.lists 2007-10-10 15:54:25 UTC
Isn't issue 70981 a duplicate ?

BTW, Cognos ReportNet also uses this HTML-XLS format to create charts...
Comment 10 ooo 2007-10-12 19:00:47 UTC
No, issue 70981 is about the CSS formatting not imported. This issue here is
about not even the HTML being imported because the UCS-2 text encoding isn't
recognized.
Comment 11 ingvarha 2008-02-26 14:18:50 UTC
Encoding aside, a fix for that should not be too hard.

Is it possible to make calc understand that a html file contains excel (or other
microsoft-special) formatting <x:foo>, warn the user, and by request parse that
formatting? Or are such parts unparsable by ooo?

Ingvar
Comment 12 jbf.faure 2010-04-02 18:07:58 UTC
Add me to cc.
Comment 13 Marcus 2017-05-20 11:13:36 UTC
Reset assigne to the default "issues@openoffice.apache.org".