HSLF doesn't have VBAInfo or VBAInfoAtom classes, or classes to look inside the external ole compressed container.
See http://poi.apache.org/poifs/embeded.html#Files+embedded+in+PowerPoint - PPT files are "special" (different team at Microsoft wrote it?), and embedded resources go in special records rather than at the OLE2 level... I suspect you'll need to do something similar to Tika's HSLFExtractor's handleSlideEmbeddedResources method
Thanks for the info. I'll try to figure this out. LibreOffice 5 can't find the VBA Project in test-data/slideshow/SimpleMacro.ppt, which I created with PowerPoint 2007.
Format information is available in: [MS-PPT]: PowerPoint (.ppt) Binary File Format Open Specifications Documentation [1] [1] https://msdn.microsoft.com/en-us/library/office/cc313106%28v=office.12%29.aspx
Created attachment 33754 [details] just a quick hack ... This is just a quick hack ... to get you started.
*** Bug 60162 has been marked as a duplicate of this bug. ***
r1765696 I made a few small changes (thanks to TestReWrite for finding one small bug!), but I basically left Andi's patch as is. This will offer at least basic access to VBAMacros. I've tried to add info in the javadocs about how ppt is not supported by VBAMacroReader directly. We can work on integrating macro extraction across the file types in another issue. For now, advanced users have what they need.