Bug 47735 - Word Extractor Does not Extract Entire File
Summary: Word Extractor Does not Extract Entire File
Status: RESOLVED FIXED
Alias: None
Product: POI
Classification: Unclassified
Component: HWPF (show other bugs)
Version: 3.5-dev
Hardware: HP Linux
: P2 major (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-08-25 06:23 UTC by sana.ahmed
Modified: 2011-06-20 16:41 UTC (History)
0 users



Attachments
This is the file that does not get extracted completely. (125.50 KB, application/octet-stream)
2009-08-25 06:23 UTC, sana.ahmed
Details

Note You need to log in before you can comment on or make changes to this bug.
Description sana.ahmed 2009-08-25 06:23:01 UTC
Created attachment 24164 [details]
This is the file that does not get extracted completely.

We are using poi 3.5 beta 6 in production to extract office documents.  We came
across a word document that did not extract completely.  The extracted text
appears to have left out a couple of paragraphs from the middle of the
document.
Comment 1 Yegor Kozlov 2011-06-20 16:41:49 UTC
Word Extractor from 3.8-beta3 correctly extracts all text from the attached document.