Bug 60942

Summary: Avoid unicode check in Word 6.0 docs
Product: POI Reporter: Tim Allison <tallison>
Component: HWPFAssignee: POI Developers List <dev>
Severity: enhancement    
Priority: P2    
Version: 3.16-dev   
Target Milestone: ---   
Hardware: PC   
OS: All   

Description Tim Allison 2017-03-30 16:11:06 UTC
This is a half step towards 60936.  

"On TIKA-2313, Steven Hall submitted an example Word 6.0 file whose extracted text is garbage."

From what I can tell 6.0 didn't use Unicode.  Until we can figure out how the codepage was specified in 6.0, we should at least turn off the Unicode check.
Comment 1 Tim Allison 2017-03-31 18:37:44 UTC

*** This bug has been marked as a duplicate of bug 50955 ***