Bug 16992

Summary: Unrecognised character "euro"
Product: POI Reporter: gr_nico
Component: HSSFAssignee: POI Developers List <dev>
Status: RESOLVED INVALID    
Severity: normal    
Priority: P3    
Version: unspecified   
Target Milestone: ---   
Hardware: PC   
OS: All   
Attachments: An excel containing only an "euro" character

Description gr_nico 2003-02-12 13:30:39 UTC
I am using jakarta-poi-1.9.0-dev-20030203.jar

When I try to set to a cell a string containing the character "euro" : '€', 
an '?' is displayed instead.
Comment 1 gr_nico 2003-02-12 13:41:15 UTC
Created attachment 4827 [details]
An excel containing only an "euro" character
Comment 2 Andy Oliver 2003-02-12 13:45:38 UTC
ask on the list.
Comment 3 Carey Sublette 2003-02-14 19:45:07 UTC
The likely cause for this is an attempt to use the Windows-1252 (Codepage-1252) euro character 
(0x80) instead of the the Unicode euro (0x20AC). Windows-1252 is an extension of ISO-8859-1 (Latin-
1) that added 27 symbols not found in Latin-1 (but found in the full Unicode character set) 

The 
org.apache.poi.util.StringUtil class specifies an encoding ("ISO-8859-1") that does not 
recognize the 27 extension symbols, and maps them on to "?". It would if it specified the encoding 
"Cp1252" instead.

I have a tested fix for this that works with all standard Unicode as well as 
the Cp1252 extensions, but haven't submitted it to Poi yet.

Carey Sublette
Comment 4 Andy Oliver 2003-07-24 15:41:10 UTC
never any more info, I think Cary is right