Apache OpenOffice (AOO) Bugzilla – Issue 125035
ooxml: Create a new framework for the parsing of OOXML documents.
Last modified: 2023-01-06 10:32:04 UTC
Create a framework that can be used by all applications to import documents in the OfficeOpenXML (OOXML) format. See https://wiki.openoffice.org/wiki/Ooxml for more information.
"af" committed SVN revision 1599430 into trunk: 125035: Initial commit for the new OOXML framework.
"af" committed SVN revision 1601582 into trunk: 125035: Can now create a validating parser.
Can now create a validating parser (complex types). The ValidatingCreator creates one NFA for each complex type and one for the top level elements. These are then converted to DFAs and minimized. At the moment this results in 6097 states and 34286 transitions.
"af" committed SVN revision 1602077 into trunk: 125035: Added support for attributes.
"af" committed SVN revision 1602118 into trunk: 125035: Added support for actions to the experimental Java parser.
"af" committed SVN revision 1603154 into trunk: 125035: Added support for attribute types (simple types).
"af" committed SVN revision 1604190 into trunk: 125035: Added support for simple types.
In MSO 2007 and above there are two file extensions for files that support OOXML: .docx and .docm. As far as I know macro support is the only difference between them. .docm is macro enabled document and .docx is not. For me it is important to know that a document has no macro code inside because nowadays macro enabled documents are used to spread malware https://nakedsecurity.sophos.com/2015/09/28/why-word-macro-malware-is-back-and-what-you-can-do-about-it/ If you at last decide to implement docx save/export support in AOO do not forget about 'macro disabled' document format. P.S. Is there any ODF based file format I can use to store documents and be sure that there is no macro code inside them?
What is the status of this new OOXML framework? Has it been abandoned?