The NetDocuments OCR process produces a MSG attachment that is the text-only result from an OCR scan of the attachment. The NetDocuments Frida service (Java process), updates MSG metadata - i.e. extension, size - in the MSG header. Apache POI had the ability to write datatypes, but our developers added methods to the POI HSMF module to properly write the header necessary for a MSG attachment. So, this patch to POI HSMF is the functionality to write attributes to the MSG header.
Can we get the patch attachment when you are ready? Or better still, a Github PR.
Created attachment 38408 [details] POI-Scratchpad HSMF patch
Hi Lyn - this needs a test - it can't be merged without one.
Created attachment 38421 [details] POI-Scratchpad HSMF patch #2
thanks - added r1904685 - I made a couple of small changes if you want to review them
(In reply to PJ Fanning from comment #5) > thanks - added r1904685 - I made a couple of small changes if you want to > review them Looks great. Is there a way to amend the commit message and include Yurii Shyman and Rick Cowdell as the developers ?
I'm afraid that changing the commit message is not really feasible.
(In reply to PJ Fanning from comment #7) > I'm afraid that changing the commit message is not really feasible. Understood.
P.J. - Can you provide details for the remaining steps for this POI patch? I see that the patch is merged into `trunk` for GitHub. Is there a calendar documenting when the next POI release is scheduled? How/when does a bug get closed ?
There are currently no plans for a formal POI release. They occur approximately every 6 months. Releases take quite a lot of volunteer effort. You can build your own jars if you need the latest fixes. One page to keep an eye is https://poi.apache.org/changes.html