Bug 62402 - XWPFWordExtractor : Incorrect order of text extracted in document(with floating tables)
Summary: XWPFWordExtractor : Incorrect order of text extracted in document(with floati...
Status: NEW
Alias: None
Product: POI
Classification: Unclassified
Component: XWPF (show other bugs)
Version: 3.17-FINAL
Hardware: PC All
: P2 major (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-05-23 13:38 UTC by Madhav
Modified: 2018-05-23 13:40 UTC (History)
0 users



Attachments
test document for bug (13.70 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2018-05-23 13:38 UTC, Madhav
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Madhav 2018-05-23 13:38:48 UTC
Created attachment 35946 [details]
test document for bug

Hi,
I am using apache poi for docx(xwpf) content extraction.

 I extracted documents content in order iterating through List<IBodyElements>, but it is not the same as that of viewed in ms-word application. 

Also XWPFWordExtractor renders in the same order i.e, incorrect(reverse)

In the text document attached below, there is heading which is followed by table.
But its rendered in reverse order by above mentioned method.

I further debugged and came to know it only occurred with floating table(in my case).

Finally my question is that does poi handle floating tables order in the document.


Thanks in advance.