Useful utility to dump the structure of a PPT file into XML format.
Created attachment 15604 [details] sources for PPDump utility
Created attachment 15605 [details] Sources for PPDump utility
Looks quite interesting, probably easier to work with than the plain text dumpers from hslf.dev :) However, it doesn't work for files not in the current directory. Might also be nice to have it dump to standard out by default, and only to a file if one is given. Couple of things that would be nice: * report the type and length in hex also (makes it easier to find them in the file) * report the byte offset to the start of the record (hex and decimal)
Created attachment 15644 [details] Improved PPDump New version of PPDump. Improvements (Thanks to Nick Burch for suggestions): - moved to org.apache.poi.hslf.dev - print record header in hex format as an XML attribute - print offset to the record - if ppt contains pictures print short info for each picture - default output is to System.out
Great, thanks for that. I've renamed it to PPTXMLDump (to make it clearer what it does), and applied to CVS