Bug 47875 - reading word written in Chinese, paragraph nums is not correct.
Summary: reading word written in Chinese, paragraph nums is not correct.
Status: RESOLVED WORKSFORME
Alias: None
Product: POI
Classification: Unclassified
Component: HWPF (show other bugs)
Version: 3.2-FINAL
Hardware: PC Windows XP
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-09-18 23:46 UTC by inthendsun
Modified: 2016-02-14 18:40 UTC (History)
0 users



Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description inthendsun 2009-09-18 23:46:15 UTC
FileInputStream fileIn = new FileInputStream("D:\\111.doc"); 

WordExtractor extractor = new WordExtractor(fileIn); 

String[] paras =extractor.getParagraphText(); 
System.out.println(paras.length); 


why the paragraph nums is not correct? Reading in English looks like no problem. But my word is written in Chinese.

thanks!
Comment 1 Yegor Kozlov 2011-06-25 12:38:04 UTC
Please attach the problematic file, without it we can't do much to help you.

Yegor
Comment 2 Dominik Stadler 2016-02-14 18:40:34 UTC
No update for a long time, therefore I am closing this, please reopen with more information if this is still a problem for you.