Bug 47613 - Content Extraction fails for embedded Excel/Word docs in PPT.
Summary: Content Extraction fails for embedded Excel/Word docs in PPT.
Alias: None
Product: POI
Classification: Unclassified
Component: HSLF (show other bugs)
Version: 3.2-FINAL
Hardware: PC Windows XP
: P2 critical (vote)
Target Milestone: ---
Assignee: POI Developers List
Depends on:
Reported: 2009-07-31 02:03 UTC by Bijju
Modified: 2009-08-03 03:02 UTC (History)
0 users


Note You need to log in before you can comment on or make changes to this bug.
Description Bijju 2009-07-31 02:03:29 UTC

I am not able to extract embedded document information of PPT. PPT has been embedded with simple excel and word documents, but those events are not raised.

When i checked with ppt/excel/word files embedded in word(or)excel those content extraction is successful.

Only with PPT there is problem. There are no events raised by the Reader for embedded objects.

Could you pls. suggest the reason for this ASAP?
Comment 1 Nick Burch 2009-08-03 03:02:55 UTC
Powerpoint doesn't embed documents as POIFS (ole2) substreams. It stores them in its own records

See http://poi.apache.org/poifs/embeded.html