Bug 35849

Summary: HSSF hangs whilst reading file
Product: POI Reporter: Dave S-B <dave>
Component: HSSFAssignee: POI Developers List <dev>
Status: RESOLVED DUPLICATE    
Severity: major    
Priority: P3    
Version: 3.0-dev   
Target Milestone: ---   
Hardware: PC   
OS: Windows XP   
Attachments: Test Excel file

Description Dave S-B 2005-07-25 14:52:53 UTC
When reading the attached excel file (also available at
http://www.lledr-solutions.co.uk/downloads/test.xls) HSSF hangs. There is no
error, the application just gets stuck in a loop somewhere (this is tested using
EFHSSF). The file is quite small and only contains numeric/string fields and can
be opened and viewed in MS Excel and OpenOffice. I can't track down the problem
with this file (other files are working fine).
Comment 1 Dave S-B 2005-07-25 14:56:01 UTC
Created attachment 15764 [details]
Test Excel file
Comment 2 Amol Deshmukh 2005-07-25 18:23:52 UTC
Apparently, a string table related issue.
The sheet has several columns containing mostly same data (~2000 rows and 4 such
columns). With the current hack in SSTDeserializer.addToStringTable(..) we
create new string for each repeated value (actually several new strings since we
iteratively add spaces).

This is a duplicate of 33681 (at least the cause is the same). 

Found this discussion about the fix:
http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html

If there is a patch already, I can test it.

*** This bug has been marked as a duplicate of 33681 ***
Comment 3 Amol Deshmukh 2005-07-25 18:40:45 UTC
http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html is
probably fix for a different issue. So looks like we never had a fix for the
multiple string creation.
Comment 4 Avik Sengupta 2005-07-25 19:32:01 UTC
 (In reply to comment #3)
> http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html is
> probably fix for a different issue. So looks like we never had a fix for the
> multiple string creation.

Yeah, that's a different one.