Issue 75653 - HTML import doesn't recognize UCS-2 encoding.
Summary: HTML import doesn't recognize UCS-2 encoding.
Alias: None
Product: Calc
Classification: Application
Component: open-import (show other issues)
Version: OOo 2.1
Hardware: All All
: P3 Trivial (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
: 75654 (view as issue list)
Depends on:
Reported: 2007-03-22 15:01 UTC by pagalmes.lists
Modified: 2017-05-20 11:13 UTC (History)
4 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---

under Excel (46.52 KB, image/png)
2007-03-22 15:01 UTC, pagalmes.lists
no flags Details
html file (33.41 KB, application/
2007-03-22 15:02 UTC, pagalmes.lists
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description pagalmes.lists 2007-03-22 15:01:19 UTC
- See how things look under Excel.
- To open the file under OOo, see issue 70981

"Start and then File - Open. Choose the document
and then choose the file typ "HTML Document ( Calc)
(*.html;*.htm)". You find this in the section, which begins with "

=> the chart is not displayed \and the data not well imported
Comment 1 pagalmes.lists 2007-03-22 15:01:52 UTC
Created attachment 43869 [details]
under Excel
Comment 2 pagalmes.lists 2007-03-22 15:02:31 UTC
Created attachment 43870 [details]
html file
Comment 3 pagalmes.lists 2007-03-22 15:19:27 UTC
I just wanted to add that this issue will impact many big companies.
This HTML-XLS format is used by many critical software for companies 
like "wmware", "Cognos"...

One of the company being impacted represents more than 10000 PC.
Comment 4 kpalagin 2007-03-25 20:56:10 UTC
Confirming with 2.2m11 - Calc would not correctly open "CPU.xls" with steps 
"Start and then File - Open. Choose the document
and then choose the file typ "HTML Document ( Calc)
Comment 5 kpalagin 2007-03-25 20:57:57 UTC
*** Issue 75654 has been marked as a duplicate of this issue. ***
Comment 6 frank 2007-03-30 14:07:40 UTC

the Chart can't be displayed because it is not a Graphic but will be drawn by
Excel if it loads this file. If you try to open this doc with a browser the
isn't displayed too.

From my point of view this file isn't a valid HTML file as it starts with two
bytes containing non printable characters. Therefore this is IMHO an invalid
Issue. Maybe Eike can shed some light on this. So I let him decide what to do
with this Issue.

Comment 7 ooo 2007-04-12 18:05:47 UTC
The file is in UCS-2LE (16-bit Unicode Little Endian) text encoding, which isn't
recognized by the HTML filter. This may be fixed, which I accept this issue for.

However, even if the HTML part would be imported, all the extra functionality
would not work because it is accomplished using private <x:...> elements and
x:... attributes specific to Excel.
Comment 8 frank 2007-04-25 10:58:53 UTC
*** Issue 75654 has been marked as a duplicate of this issue. ***
Comment 9 pagalmes.lists 2007-10-10 15:54:25 UTC
Isn't issue 70981 a duplicate ?

BTW, Cognos ReportNet also uses this HTML-XLS format to create charts...
Comment 10 ooo 2007-10-12 19:00:47 UTC
No, issue 70981 is about the CSS formatting not imported. This issue here is
about not even the HTML being imported because the UCS-2 text encoding isn't
Comment 11 ingvarha 2008-02-26 14:18:50 UTC
Encoding aside, a fix for that should not be too hard.

Is it possible to make calc understand that a html file contains excel (or other
microsoft-special) formatting <x:foo>, warn the user, and by request parse that
formatting? Or are such parts unparsable by ooo?

Comment 12 jbf.faure 2010-04-02 18:07:58 UTC
Add me to cc.
Comment 13 Marcus 2017-05-20 11:13:36 UTC
Reset assigne to the default "".