Summary: | Unrecognised character "euro" | ||
---|---|---|---|
Product: | POI | Reporter: | gr_nico |
Component: | HSSF | Assignee: | POI Developers List <dev> |
Status: | RESOLVED INVALID | ||
Severity: | normal | ||
Priority: | P3 | ||
Version: | unspecified | ||
Target Milestone: | --- | ||
Hardware: | PC | ||
OS: | All | ||
Attachments: | An excel containing only an "euro" character |
Description
gr_nico
2003-02-12 13:30:39 UTC
Created attachment 4827 [details]
An excel containing only an "euro" character
ask on the list. The likely cause for this is an attempt to use the Windows-1252 (Codepage-1252) euro character (0x80) instead of the the Unicode euro (0x20AC). Windows-1252 is an extension of ISO-8859-1 (Latin- 1) that added 27 symbols not found in Latin-1 (but found in the full Unicode character set) The org.apache.poi.util.StringUtil class specifies an encoding ("ISO-8859-1") that does not recognize the 27 extension symbols, and maps them on to "?". It would if it specified the encoding "Cp1252" instead. I have a tested fix for this that works with all standard Unicode as well as the Cp1252 extensions, but haven't submitted it to Poi yet. Carey Sublette never any more info, I think Cary is right |