Bug 58214 - Error with outlook while opening an msg files extracted by POI from another msg file
Summary: Error with outlook while opening an msg files extracted by POI from another m...
Status: NEEDINFO
Alias: None
Product: POI
Classification: Unclassified
Component: HSMF (show other bugs)
Version: 3.13-dev
Hardware: PC All
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-08-05 12:56 UTC by Alexandre
Modified: 2015-09-02 12:19 UTC (History)
0 users



Attachments
The master mail with attachment (49.50 KB, application/x-ole-storage)
2015-08-05 12:56 UTC, Alexandre
Details
Embedded outlook msg extracted by POI (20.50 KB, application/x-ole-storage)
2015-08-05 12:57 UTC, Alexandre
Details
Embedded outlook msg saved by Outlook (22.50 KB, application/x-ole-storage)
2015-08-05 12:58 UTC, Alexandre
Details
My simple java code (1.12 KB, text/plain)
2015-08-05 12:58 UTC, Alexandre
Details
Attachment from outlook, POIFSLister result (1.50 KB, application/octet-stream)
2015-08-05 14:53 UTC, Alexandre
Details
Attachment from POI, POIFSLister result (1.21 KB, application/octet-stream)
2015-08-05 14:54 UTC, Alexandre
Details
Attachment from outlook, HSMFDump result (12.88 KB, application/octet-stream)
2015-08-05 14:54 UTC, Alexandre
Details
Attachment from POI, HSMFDump result (6.75 KB, application/octet-stream)
2015-08-05 14:55 UTC, Alexandre
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Alexandre 2015-08-05 12:56:22 UTC
Created attachment 32960 [details]
The master mail with attachment

As describe in this bug : https://bz.apache.org/bugzilla/show_bug.cgi?id=58211 

I have an Outlook message (Test mail attachment) in attachment of another Outlook message (Master mail.msg). 
The msg file saved using POI (Test mail attachment_from POI.msg) generate an error when I try to open it in Outlook. Outlook does not recognize it.

I join the attachment msg file saved by outlook (Test mail attachment_from outlook.msg). 

I also join my code in case. (ExtractMsg.java)
Comment 1 Alexandre 2015-08-05 12:57:14 UTC
Created attachment 32961 [details]
Embedded outlook msg extracted by POI
Comment 2 Alexandre 2015-08-05 12:58:33 UTC
Created attachment 32962 [details]
Embedded outlook msg saved by Outlook
Comment 3 Alexandre 2015-08-05 12:58:58 UTC
Created attachment 32963 [details]
My simple java code
Comment 4 Nick Burch 2015-08-05 14:35:57 UTC
Thanks for those

Any chance you could try running dev tools like org.apache.poi.poifs.dev.POIFSLister and org.apache.poi.hsmf.dev.HSMFDump against the two extracted files, and see if there are any obvious differences between them? Sections in one not the other, different IDs, that sort of thing. That should help us narrow in on what to change
Comment 5 Alexandre 2015-08-05 14:53:46 UTC
Created attachment 32966 [details]
Attachment from outlook, POIFSLister result
Comment 6 Alexandre 2015-08-05 14:54:15 UTC
Created attachment 32967 [details]
Attachment from POI, POIFSLister result
Comment 7 Alexandre 2015-08-05 14:54:53 UTC
Created attachment 32968 [details]
Attachment from outlook, HSMFDump  result
Comment 8 Alexandre 2015-08-05 14:55:13 UTC
Created attachment 32969 [details]
Attachment from POI, HSMFDump  result
Comment 9 Alexandre 2015-08-05 14:57:45 UTC
There are differences between two extracted files using both tools. 

Hard for me to interpret differences.
Comment 10 Alexandre 2015-08-31 14:39:53 UTC
Hi, 

do you have any update about this issue ?
Comment 11 Nick Burch 2015-09-01 10:54:03 UTC
If you're able to identify the differences, we might be able to make a quick fix, or otherwise guide you on making the fix yourself

Otherwise, this issue will remain until someone volunteers to spend their free time looking at it, be they a committer or just someone else interested from within the community
Comment 12 Alexandre 2015-09-02 07:44:26 UTC
I see. Unfortunately  I'm not able to identify the differences yet. I will hope for a interested volunteers.
Thanks.
Comment 13 Tim Allison 2015-09-02 12:19:02 UTC
Not that this is any consolation/help, but it looks like POI (at least via Tika) is able to read the contents of the embedded document, both from the outer container document and from the version that you attached as extracted by POI.