Issue 62683

Summary: Freeze on open Excel2003 XML file with incorrect symbol within
Product: Calc Reporter: pmike <www.openoffice.org>
Component: open-importAssignee: AOO issues mailing list <issues>
Status: ACCEPTED --- QA Contact:
Severity: Trivial    
Priority: P3 CC: ahz001, issues, knmc, lars_o_hansen
Version: OOo 2.0.2Keywords: oooqa
Target Milestone: ---   
Hardware: PC   
OS: Windows XP   
Issue Type: DEFECT Latest Confirmation in: ---
Developer Difficulty: ---
Attachments:
Description Flags
sample none

Description pmike 2006-03-01 14:48:41 UTC
Excel2003 XML file contains incorrect symbol.
OOo Calc freezes on import.
Comment 1 pmike 2006-03-01 14:49:26 UTC
Created attachment 34529 [details]
sample
Comment 2 lars 2006-03-01 20:37:48 UTC
confirmed with OOo 2.0.2 RC4 on WinXP Pro SP2
Comment 3 frank 2006-03-08 10:09:03 UTC
Hi Swante,

seems to be yours.

Frank
Comment 4 svante.schubert 2006-03-08 11:08:25 UTC
You mentioned an incorrect symbol, what did you mean, pmike?
You are completly right, it hangs up. Going to take a closer look in it for pp3.

It really seems an enconding problem, the problem is triggered by the XML parser
(Apaches Xerces implementation) we use:

Caused by: java.io.UTFDataFormatException: Invalid byte 1 of 1-byte UTF-8 sequence.
	at org.apache.xerces.impl.io.UTF8Reader.invalidByte(Unknown Source)
	at org.apache.xerces.impl.io.UTF8Reader.read(Unknown Source)
	at org.apache.xerces.impl.XMLEntityScanner.load(Unknown Source)

Other applications as Firefox handle it correctly: The test file can be loaded
and it is loadable with our Office as well, if you load this file earlier in a
browser (firefox 1.5.0.1) and save it, instead directly saving it.
Comment 5 pmike 2006-03-08 16:50:31 UTC
Well, I'm not sure about correctness of "incorrect symbol", English isn't my
native language.
You're right, the file contains invalid (for UTF8 encoder) byte sequence.

I think the perfect solution will be to load file and show warning in status
area, like MSIE did. Alternatively, drop a warning during load, or simply
ingnore invalid bytes.
Comment 6 svante.schubert 2006-07-24 18:18:31 UTC
sus->pmike: 
There will be an upgrade of the XML parser and processor from Xalan 2.6 to 2.7.
I will take a look after the integration into it.
Due to other high priority tasks, I will have to postpone this issue to a later
update release for now.

The Warning message is something I desire as well, but there was no time to work
on this now. Hopefully we find someone from the community to work on this.

Comment 7 pmike 2007-05-04 14:06:30 UTC
OOo 2.2 doesn't freeze (I suppose xalan was upgraded).
But loaded doc is completely empty.
Excel2003 opens file with data.

So its OK to close this issue as FIXED (calc doesn't freeze), but skipping
invalid symbol and opening data would be really nice to have (separate issue?).
Comment 8 Martin Hollmichel 2008-01-22 07:49:43 UTC
move target from 2.x to 3.0
Comment 9 svante.schubert 2008-05-30 14:47:06 UTC
I am sorry, I am no longer working on this format.
Changed target to Office later.
Is anybody able to take this over? And/or provide a patch for this problem?

Thanks in advance,
Svante
Comment 10 oooforum (fr) 2018-11-18 12:12:29 UTC
*** Issue 67733 has been marked as a duplicate of this issue. ***