Text extract omits run text where the run contains a rsidDel attribute. This is incorrect as rsid* attributes are simply revision session identifiers. It is possible for this attribute to be present, but the run text still be valid. Instead of the revision session id attributes, text extract should key on specific revision tags to determine which text to omit. The appropriate tag to omit is <delText>
This issue was introduced by Bug #58067
Created attachment 35540 [details] A .docx with rsidDel attributes
Adjusted this with r1819405 as follows: * Instead of rsiddel check for deltext to exclude deleted content * Also add runs from insertions in trackchanges to add inserted text correctly Hopefully this now makes it work better across the various ways documents can contain text-content.