When reading the attached excel file (also available at http://www.lledr-solutions.co.uk/downloads/test.xls) HSSF hangs. There is no error, the application just gets stuck in a loop somewhere (this is tested using EFHSSF). The file is quite small and only contains numeric/string fields and can be opened and viewed in MS Excel and OpenOffice. I can't track down the problem with this file (other files are working fine).
Created attachment 15764 [details] Test Excel file
Apparently, a string table related issue. The sheet has several columns containing mostly same data (~2000 rows and 4 such columns). With the current hack in SSTDeserializer.addToStringTable(..) we create new string for each repeated value (actually several new strings since we iteratively add spaces). This is a duplicate of 33681 (at least the cause is the same). Found this discussion about the fix: http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html If there is a patch already, I can test it. *** This bug has been marked as a duplicate of 33681 ***
http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html is probably fix for a different issue. So looks like we never had a fix for the multiple string creation.
(In reply to comment #3) > http://www.mail-archive.com/poi-user@jakarta.apache.org/msg05915.html is > probably fix for a different issue. So looks like we never had a fix for the > multiple string creation. Yeah, that's a different one.