Bug 51772 - IllegalArgumentException Parsing MS Word 97 - 2003
Summary: IllegalArgumentException Parsing MS Word 97 - 2003
Status: RESOLVED FIXED
Alias: None
Product: POI
Classification: Unclassified
Component: HWPF (show other bugs)
Version: 3.8-dev
Hardware: PC All
: P2 major (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-09-06 13:14 UTC by pqueixalos
Modified: 2011-09-07 12:13 UTC (History)
0 users



Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description pqueixalos 2011-09-06 13:14:28 UTC
Example file throwing IllegalArgumentException :
http://www.ac-nancy-metz.fr/enseign/physique/nouvcoll/4-matiere/Exemple%20s%C3%A9ance%20TIC%20et%20Prisme.doc

Caused by: java.lang.IllegalArgumentException: charStart (3102) > charEnd (3091)
at org.apache.poi.hwpf.model.BytePropertyNode.<init>(BytePropertyNode.java:61)
at org.apache.poi.hwpf.model.CHPX.<init>(CHPX.java:53)
at org.apache.poi.hwpf.model.CHPFormattedDiskPage.<init>(CHPFormattedDiskPage.java:91)
at org.apache.poi.hwpf.model.CHPBinTable.<init>(CHPBinTable.java:101)
at org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:280)
at org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.java:67)
at org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:196)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
... 41 more
Comment 1 Sergey Vladimirov 2011-09-07 12:13:35 UTC
Fixed in r1166144;

Thanks for reporting this bug. It turns out we made some assumptions about Word files that were incorrect. You bug helped us to fix it.