Bug 45542

Summary: poi-3.5-beta1-20080718.jar - content from the comments of an Office 2003 xls document is not extracted.
Product: POI Reporter: xtrim <grizolle_benedicte>
Component: POI OverallAssignee: POI Developers List <dev>
Status: RESOLVED WORKSFORME    
Severity: normal    
Priority: P2    
Version: unspecified   
Target Milestone: ---   
Hardware: PC   
OS: Windows Server 2003   
Attachments: Contains JUnit test class and documents used for testing.

Description xtrim 2008-08-04 09:04:03 UTC
Created attachment 22366 [details]
Contains JUnit test class and documents used for testing.

The text contained in the comments of an excel 2003 document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.
We expected to extract the words "testdoc" and "test phrase".

Notes on the attached documents:

- the document "classic_TextInComment.xls"and "form_TextInComment.xls"contain the words "testdoc" and "test phrase" as comments.


"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-04 14:35:51 UTC
Comment extraction is supported, just turned off by default. Enable it and they are included. See the javadocs for details:

http://poi.apache.org/apidocs/org/apache/poi/hssf/extractor/ExcelExtractor.html