Bug 55935 - Inner table rendered outside
Summary: Inner table rendered outside
Status: NEW
Alias: None
Product: POI
Classification: Unclassified
Component: HWPF (show other bugs)
Version: unspecified
Hardware: PC All
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
Depends on:
Reported: 2013-12-26 11:02 UTC by Hrishi
Modified: 2014-01-02 12:09 UTC (History)
0 users

Original doc file (26.50 KB, application/msword)
2013-12-26 11:02 UTC, Hrishi
Output html file (1.41 KB, text/html)
2013-12-26 11:07 UTC, Hrishi

Note You need to log in before you can comment on or make changes to this bug.
Description Hrishi 2013-12-26 11:02:59 UTC
Created attachment 31156 [details]
Original doc file

I am using POI 3.9 version to convert word document to html.

While working with table, it should not process properly.

In a table, if a first row of first cell contains a table it will be rendered as separate table.
Comment 1 Hrishi 2013-12-26 11:07:35 UTC
Created attachment 31157 [details]
Output html file
Comment 2 Andreas Beeker 2013-12-26 17:51:02 UTC
The AbstractWordConverter/WordToHtmlConverter assumes, that nested tables are listed after their outer tables, but in your example, the nested tables is processed first and the outer table doesn't know of its children.
I'm not sure, if this is just a problem of the converter, or if the record parsing /linking is wrong.

I haven't read the word spec up till now and apparently there's a lack of hwpf maintainer (http://poi.apache.org/hwpf/#HWPF+Pointman+Needed%21) ... can you use XWPF (.docx) instead? (... as the xml structure is probably more straight-forward ...)
Comment 3 Hrishi 2014-01-02 11:14:19 UTC
How can i use the xwpf? because i want to convert .doc files. How can i read *.doc files using xwpf instead xwpf?
Comment 4 Hrishi 2014-01-02 12:09:57 UTC
Also it happens only when i have created the nested table in first row , first cell of the table. otherwise it has rendered as it is in my word file. The only problem is in the table[0,0]