Bug 45534 - poi-3.5-beta1-20080718.jar - content from a linked document of an Office 2003 doc is not extracted.
Summary: poi-3.5-beta1-20080718.jar - content from a linked document of an Office 2003...
Status: RESOLVED WONTFIX
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: unspecified
Hardware: PC Windows Server 2003
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-08-04 06:56 UTC by xtrim
Modified: 2008-08-04 07:57 UTC (History)
0 users



Attachments
Contains JUnit test class and documents used for testing. (89.39 KB, application/x-zip-compressed)
2008-08-04 07:08 UTC, xtrim
Details
New examples of documents used for testing. (103.98 KB, application/x-zip-compressed)
2008-08-04 07:54 UTC, xtrim
Details

Note You need to log in before you can comment on or make changes to this bug.
Description xtrim 2008-08-04 06:56:20 UTC
When an office 2003 document contains a link to an other document, the content of this linked document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.
We expected to extract the word "testdoc".

notes on the attached document:

- the document "classic_ContentLinkedObject_excel.doc" contains an hyperlink to the xls document "TestLinkedObject1.xls" that contains the word "testdoc".

- the document "classic_ContentLinkedObject_ppt.doc" contains an hyperlink to the ppt document "TestLinkedObject1.ppt" that contains the word "testdoc".

- the document "classic_ContentLinkedObject_word.doc" contains an hyperlink to the doc document "TestLinkedObject1.doc" that contains the word "testdoc".
Comment 1 xtrim 2008-08-04 07:08:55 UTC
Created attachment 22356 [details]
Contains JUnit test class and documents used for testing.

The attachment is a ZIP file.
Comment 2 xtrim 2008-08-04 07:54:48 UTC
Created attachment 22360 [details]
New examples of documents used for testing.

The attachment is a zip file.

- the document "form_ContentLinkedObject_excel.doc" contains an hyperlink to the xls document "TestLinkedObject1.xls" that contains the word "testdoc".

- the document "form_ContentLinkedObject_ppt.doc" contains an hyperlink to the ppt document "TestLinkedObject1.ppt" that contains the word "testdoc".

- the document "form_ContentLinkedObject_word.doc" contains an hyperlink to the doc document "TestLinkedObject1.doc" that contains the word "testdoc".
Comment 3 Nick Burch 2008-08-04 07:57:29 UTC
You'll need to pass the linked OLE2 documents off to the appropriate h##f code
to process them.

POI doesn't automatically re-curse into linked documents.