Apache OpenOffice (AOO) Bugzilla – Issue 75653
HTML import doesn't recognize UCS-2 encoding.
Last modified: 2017-05-20 11:13:36 UTC
- See how things look under Excel. - To open the file under OOo, see issue 70981 "Start OpenOffice.org and then File - Open. Choose the document and then choose the file typ "HTML Document (OpenOffice.org Calc) (*.html;*.htm)". You find this in the section, which begins with "OpenOffice.org Spreadsheet"." => the chart is not displayed \and the data not well imported
Created attachment 43869 [details] under Excel
Created attachment 43870 [details] html file
I just wanted to add that this issue will impact many big companies. This HTML-XLS format is used by many critical software for companies like "wmware", "Cognos"... One of the company being impacted represents more than 10000 PC.
Confirming with 2.2m11 - Calc would not correctly open "CPU.xls" with steps "Start OpenOffice.org and then File - Open. Choose the document and then choose the file typ "HTML Document (OpenOffice.org Calc) (*.html;*.htm)".
*** Issue 75654 has been marked as a duplicate of this issue. ***
Hi, the Chart can't be displayed because it is not a Graphic but will be drawn by Excel if it loads this file. If you try to open this doc with a browser the isn't displayed too. From my point of view this file isn't a valid HTML file as it starts with two bytes containing non printable characters. Therefore this is IMHO an invalid Issue. Maybe Eike can shed some light on this. So I let him decide what to do with this Issue. Frank
The file is in UCS-2LE (16-bit Unicode Little Endian) text encoding, which isn't recognized by the HTML filter. This may be fixed, which I accept this issue for. However, even if the HTML part would be imported, all the extra functionality would not work because it is accomplished using private <x:...> elements and x:... attributes specific to Excel.
Isn't issue 70981 a duplicate ? BTW, Cognos ReportNet also uses this HTML-XLS format to create charts...
No, issue 70981 is about the CSS formatting not imported. This issue here is about not even the HTML being imported because the UCS-2 text encoding isn't recognized.
Encoding aside, a fix for that should not be too hard. Is it possible to make calc understand that a html file contains excel (or other microsoft-special) formatting <x:foo>, warn the user, and by request parse that formatting? Or are such parts unparsable by ooo? Ingvar
Add me to cc.
Reset assigne to the default "issues@openoffice.apache.org".