ExtractorFactory.createExtractor(pkg) doesn't work for the macro-enabled Word (.docm) with mimetype: - "application/vnd.ms-word.document.macroEnabled.12" Exception: java.lang.IllegalArgumentException: No supported documents found in the OOXML package It is not desperately necessary because with the XWPFWordExtractor class it works too. But it would be nice when this can be fixed. Best regards, Khalid
Any chance you could upload a sample .docm file? We'll want that so we can write unit tests to ensure things work after the fix, and don't get broken again later on
Created attachment 23706 [details] Common docm file (Word 2007 macro-enabled document) causes error described
The bug appears in apache-poi-3.5-beta5. File that causes error is uploaded as an attachment.
I've just added a unit test for this, and it all passes just fine. I think this bug has already been fixed