Bug 63575

Summary: XWPFWordExtractor - capitalized text (<w:caps/>)
Product: POI Reporter: Franz Seidl <website>
Component: XWPFAssignee: POI Developers List <dev>
Status: NEW ---    
Severity: normal CC: website
Priority: P2    
Version: unspecified   
Target Milestone: ---   
Hardware: PC   
OS: Linux   
Attachments: Example

Description Franz Seidl 2019-07-21 09:58:32 UTC
Created attachment 36670 [details]

XWPFWordExtractor doesn't respect text which is formatted capitalized (<w:caps/>).

See attached example:
  - WordTextExtractorDocx.java: test program
  - capitalized.docx: test file
  - capitalized.txt: "text only" version saved with Word

I expect the text: "The following word is: CAPITALIZED."
Instead I get: "The following word is: capitalized."
Comment 1 Franz Seidl 2019-07-21 10:04:33 UTC
Similar to bug Bug 63576