Apache OpenOffice (AOO) Bugzilla – Issue 79121
Extended Unicode characters create corrupted ODS file.
Last modified: 2013-08-07 15:14:13 UTC
Create a Calc file with some extended (Plane 1) Unicode characters. You must include a number of different characters. Try the forty twenty extended Unicode characters in the free font http://users.teilar.gr/~g1951d for example and use characters 10000 to 100036 in different cells. The file saves properly as an ODS file, but when I attempt to read it in again, I get the message: Read error: Format error discovered in the file at 2,11247125(row. col) The number varies. If I save the file as an Excel file (Microsoft Excel 97/2000/XP (.xls)) I have no problems in reading it back in. I also have no problems with reading an .ods file containing any number of Plane 0 Unicode characters. Accordingly I believe the problem has to do with Unicode surrogate values not being properly saved. Jallan3
jallan3: Can you please attach an example file to this issue (maybe in both formats)?
Hi, tried to reproduce but was not able to get this problem. Due to this and the inactivity of the submitter I close this Issue as worksforme. Feel free to re-open it if you attach a file showing the problem, even the broken ODS file will do it. Frank
closed wfm
I attempted further tests, seeing what happened on an Windows XP system as opposed to a Vista system (and limiting my testing only to the Linear-B block, that is U+10000 to U+100FA). Under XP I do not find any problems in opening and reading .ods files, other than that character U+10000 has always vanished when the Calc file is opened, though it is retained in an .xls file. But all files open. Trying again in my Vista machine, I find that U+10000 similarly always vanishes when a file is saved as .ods, but not when the file is saved as .xls. Also, I find that in Vista, files saved as .ods are only *usually* not readable, but have been unable to determine what causes this. I attach three .xls files and three corresponding .ods files created from the .xls files, two of which will not open and one which will open (but with U+10000 missing). Jim Allan
Created attachment 47476 [details] Characters U+10000 to 10004 in Excel format
Created attachment 47477 [details] U+10000 to U+100004 Calc ods file which will not open
Created attachment 47478 [details] U+10000 to 100F9 Excel file
Created attachment 47479 [details] U+10000 to 100F9 Calc ods file which does open successfully
Created attachment 47480 [details] All Linear-B printable characters in Excel format
Created attachment 47481 [details] All Linear-B characters in Calc ods file which will not open
Hi Niklas, please have a look at this one. Frank
duplicate *** This issue has been marked as a duplicate of 66693 ***
closing duplicate