Created attachment 32960 [details] The master mail with attachment As describe in this bug : https://bz.apache.org/bugzilla/show_bug.cgi?id=58211 I have an Outlook message (Test mail attachment) in attachment of another Outlook message (Master mail.msg). The msg file saved using POI (Test mail attachment_from POI.msg) generate an error when I try to open it in Outlook. Outlook does not recognize it. I join the attachment msg file saved by outlook (Test mail attachment_from outlook.msg). I also join my code in case. (ExtractMsg.java)
Created attachment 32961 [details] Embedded outlook msg extracted by POI
Created attachment 32962 [details] Embedded outlook msg saved by Outlook
Created attachment 32963 [details] My simple java code
Thanks for those Any chance you could try running dev tools like org.apache.poi.poifs.dev.POIFSLister and org.apache.poi.hsmf.dev.HSMFDump against the two extracted files, and see if there are any obvious differences between them? Sections in one not the other, different IDs, that sort of thing. That should help us narrow in on what to change
Created attachment 32966 [details] Attachment from outlook, POIFSLister result
Created attachment 32967 [details] Attachment from POI, POIFSLister result
Created attachment 32968 [details] Attachment from outlook, HSMFDump result
Created attachment 32969 [details] Attachment from POI, HSMFDump result
There are differences between two extracted files using both tools. Hard for me to interpret differences.
Hi, do you have any update about this issue ?
If you're able to identify the differences, we might be able to make a quick fix, or otherwise guide you on making the fix yourself Otherwise, this issue will remain until someone volunteers to spend their free time looking at it, be they a committer or just someone else interested from within the community
I see. Unfortunately I'm not able to identify the differences yet. I will hope for a interested volunteers. Thanks.
Not that this is any consolation/help, but it looks like POI (at least via Tika) is able to read the contents of the embedded document, both from the outer container document and from the version that you attached as extracted by POI.