Bug 42074 - Unable to read Double Byte Characters
Summary: Unable to read Double Byte Characters
Alias: None
Product: POI
Classification: Unclassified
Component: HSSF (show other bugs)
Version: 2.5-FINAL
Hardware: PC Windows XP
: P2 regression (vote)
Target Milestone: ---
Assignee: POI Developers List
Depends on:
Reported: 2007-04-09 21:08 UTC by Stanley Stein
Modified: 2008-01-09 05:49 UTC (History)
0 users


Note You need to log in before you can comment on or make changes to this bug.
Description Stanley Stein 2007-04-09 21:08:45 UTC
I have an Excel files consisting of some Traditional Chinese characters and
normal alphabets. When I use HSSFCell::getStringCellValue() to read some Chinese
characters, it returns question marks.

e.g. The cell contains "How are you 你好嗎"
The returned value will be "How are you ???"

I also try to add HSSFCell::setEncoding(HSSFCell.ENCODING_UTF_16) before reading
the cell, but the problem still occurs.

Please kindly advise. Thanks a lot.
Comment 1 Nick Burch 2008-01-09 05:49:10 UTC
Should work fine with POI 3.0.1 and later

Please re-open with a failing test case if your problem is still there with a
svn checkout of poi