|Summary:||poi-3.5-beta1-20080718.jar - content from the comments of an Office 2003 ppt document is not extracted.|
|Component:||POI Overall||Assignee:||POI Developers List <dev>|
|OS:||Windows Server 2003|
|Attachments:||Contains JUnit test class and documents used for testing.|
Description xtrim 2008-08-04 09:08:17 UTC
Created attachment 22367 [details] Contains JUnit test class and documents used for testing. The text contained in the comments of a power point 2003 document is not extracted. Find in attachments the JUnit test class and the documents used for testing. We expected to extract the words "testdoc". Notes on the attached documents: - the document "TextInComment.ppt" contains the word "testdoc" as comment. "TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-04 15:48:44 UTC
Support now in svn trunk for this. You will need to set a flag on your PowerPoint extractor to turn on comment extraction though, as it's off by default