Bug 42074

Summary: Unable to read Double Byte Characters
Product: POI Reporter: Stanley Stein <stanley1610>
Component: HSSFAssignee: POI Developers List <dev>
Status: RESOLVED WORKSFORME    
Severity: regression    
Priority: P2    
Version: 2.5-FINAL   
Target Milestone: ---   
Hardware: PC   
OS: Windows XP   

Description Stanley Stein 2007-04-09 21:08:45 UTC
I have an Excel files consisting of some Traditional Chinese characters and
normal alphabets. When I use HSSFCell::getStringCellValue() to read some Chinese
characters, it returns question marks.

e.g. The cell contains "How are you &#20320;&#22909;&#21966;"
The returned value will be "How are you ???"

I also try to add HSSFCell::setEncoding(HSSFCell.ENCODING_UTF_16) before reading
the cell, but the problem still occurs.

Please kindly advise. Thanks a lot.
Comment 1 Nick Burch 2008-01-09 05:49:10 UTC
Should work fine with POI 3.0.1 and later

Please re-open with a failing test case if your problem is still there with a
svn checkout of poi