Issue 79121 - Extended Unicode characters create corrupted ODS file.
Summary: Extended Unicode characters create corrupted ODS file.
Status: CLOSED DUPLICATE of issue 66693
Alias: None
Product: Calc
Classification: Application
Component: save-export (show other issues)
Version: OOo 2.2.1
Hardware: PC All
: P3 Trivial (vote)
Target Milestone: ---
Assignee: niklas.nebel
QA Contact: issues@sc
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-07-03 00:28 UTC by jallan3
Modified: 2013-08-07 15:14 UTC (History)
1 user (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Characters U+10000 to 10004 in Excel format (63.50 KB, application/vnd.ms-excel)
2007-08-11 03:12 UTC, jallan3
no flags Details
U+10000 to U+100004 Calc ods file which will not open (9.02 KB, application/vnd.oasis.opendocument.spreadsheet)
2007-08-11 03:13 UTC, jallan3
no flags Details
U+10000 to 100F9 Excel file (107.00 KB, application/vnd.ms-excel)
2007-08-11 03:15 UTC, jallan3
no flags Details
U+10000 to 100F9 Calc ods file which does open successfully (16.08 KB, application/vnd.oasis.opendocument.spreadsheet)
2007-08-11 03:17 UTC, jallan3
no flags Details
All Linear-B printable characters in Excel format (107.50 KB, application/vnd.ms-excel)
2007-08-11 03:18 UTC, jallan3
no flags Details
All Linear-B characters in Calc ods file which will not open (16.19 KB, application/vnd.oasis.opendocument.spreadsheet)
2007-08-11 03:20 UTC, jallan3
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description jallan3 2007-07-03 00:28:25 UTC
Create a Calc file with some extended (Plane 1) Unicode characters. You must
include a number of different characters. Try the forty twenty extended Unicode
characters in the free font http://users.teilar.gr/~g1951d for example and use
characters 10000 to 100036 in different cells.

The file saves properly as an ODS file, but when I attempt to read it in again,
I get the message:

Read error:
Format error discovered in the file at 2,11247125(row. col) 

The number varies.

If I save the file as an Excel file (Microsoft Excel 97/2000/XP (.xls)) I have
no problems in reading it back in. I also have no problems with reading an .ods
file containing any number of Plane 0 Unicode characters.

Accordingly I believe the problem has to do with Unicode surrogate values not
being properly saved.

Jallan3
Comment 1 mbayer 2007-07-03 00:40:50 UTC
jallan3: Can you please attach an example file to this issue (maybe in both
formats)?
Comment 2 frank 2007-08-07 14:22:47 UTC
Hi,

tried to reproduce but was not able to get this problem. Due to this and the
inactivity of the submitter I close this Issue as worksforme. Feel free to
re-open it if you attach a file showing the problem, even the broken ODS file
will do it.

Frank
Comment 3 frank 2007-08-07 14:23:06 UTC
closed wfm
Comment 4 jallan3 2007-08-11 03:07:48 UTC
I attempted further tests, seeing what happened on an Windows XP system as
opposed to a Vista system (and limiting my testing only to the Linear-B block,
that is U+10000 to U+100FA).

Under XP I do not find any problems in opening and reading .ods files, other
than that character U+10000 has always vanished when the Calc file is opened,
though it is retained in an .xls file. But all files open.

Trying again in my Vista machine, I find that U+10000 similarly always vanishes
when a file is saved as .ods, but not when the file is saved as .xls. Also, I
find that in Vista, files saved as .ods are only *usually* not readable, but
have been unable to determine what causes this.

I attach three .xls files and three corresponding .ods files created from the
.xls files, two of which will not open and one which will open (but with U+10000
missing).

Jim Allan



Comment 5 jallan3 2007-08-11 03:12:12 UTC
Created attachment 47476 [details]
Characters U+10000 to 10004 in Excel format
Comment 6 jallan3 2007-08-11 03:14:00 UTC
Created attachment 47477 [details]
U+10000 to U+100004 Calc ods file which will not open
Comment 7 jallan3 2007-08-11 03:15:35 UTC
Created attachment 47478 [details]
U+10000 to 100F9 Excel file
Comment 8 jallan3 2007-08-11 03:17:46 UTC
Created attachment 47479 [details]
U+10000 to 100F9 Calc ods file which does open successfully
Comment 9 jallan3 2007-08-11 03:18:56 UTC
Created attachment 47480 [details]
All Linear-B printable characters in Excel format
Comment 10 jallan3 2007-08-11 03:20:10 UTC
Created attachment 47481 [details]
All Linear-B characters in Calc ods file which will not open
Comment 11 frank 2007-08-23 12:25:14 UTC
Hi Niklas,

please have a look at this one.

Frank
Comment 12 niklas.nebel 2007-08-24 17:48:24 UTC
duplicate

*** This issue has been marked as a duplicate of 66693 ***
Comment 13 niklas.nebel 2007-08-24 17:49:04 UTC
closing duplicate