Bug 45535 - poi-3.5-beta1-20080718.jar - content from an embedded document of an Office 2003 doc document is not extracted.
Summary: poi-3.5-beta1-20080718.jar - content from an embedded document of an Office 2...
Status: RESOLVED WONTFIX
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: unspecified
Hardware: PC Windows Server 2003
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-08-04 07:26 UTC by xtrim
Modified: 2008-08-04 07:43 UTC (History)
0 users



Attachments
Contains JUnit test class and documents used for testing. (51.49 KB, application/x-zip-compressed)
2008-08-04 07:26 UTC, xtrim
Details
The attachments contains new examples of documents used for testing. (58.05 KB, application/x-zip-compressed)
2008-08-04 07:43 UTC, xtrim
Details

Note You need to log in before you can comment on or make changes to this bug.
Description xtrim 2008-08-04 07:26:25 UTC
Created attachment 22358 [details]
Contains JUnit test class and documents used for testing.

When an office 2003 word document contains an embedded document, the content of this embedded document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.

We expected to extract the words "testdoc" and "test phrase".


notes on the attached document:

- the document "classic_EmbeddedObject_excel.doc" contains an xls embedded document that contains the words "testdoc" and "test phrase"

- the document "classic_EmbeddedObject_ppt.doc" contains a ppt embedded document that contains the words "testdoc" and "test phrase"

- the document "classic_EmbeddedObject_word.doc" contains a doc embedded document that contains the words "testdoc" and "test phrase"

"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-04 07:33:07 UTC
You'll need to pass the embeded OLE2 documents off to the appropriate h##f code to process them.

POI doesn't automatically re-curse into embeded documents.
Comment 2 xtrim 2008-08-04 07:43:21 UTC
Created attachment 22359 [details]
The attachments contains new examples of documents used for testing.