Bug 55935

Summary: Inner table rendered outside
Product: POI Reporter: Hrishi <aswiniragunath>
Component: HWPFAssignee: POI Developers List <dev>
Status: NEW ---    
Severity: normal    
Priority: P2    
Version: unspecified   
Target Milestone: ---   
Hardware: PC   
OS: All   
Attachments: Original doc file
Output html file

Description Hrishi 2013-12-26 11:02:59 UTC
Created attachment 31156 [details]
Original doc file

I am using POI 3.9 version to convert word document to html.

While working with table, it should not process properly.

In a table, if a first row of first cell contains a table it will be rendered as separate table.
Comment 1 Hrishi 2013-12-26 11:07:35 UTC
Created attachment 31157 [details]
Output html file
Comment 2 Andreas Beeker 2013-12-26 17:51:02 UTC
The AbstractWordConverter/WordToHtmlConverter assumes, that nested tables are listed after their outer tables, but in your example, the nested tables is processed first and the outer table doesn't know of its children.
I'm not sure, if this is just a problem of the converter, or if the record parsing /linking is wrong.

I haven't read the word spec up till now and apparently there's a lack of hwpf maintainer (http://poi.apache.org/hwpf/#HWPF+Pointman+Needed%21) ... can you use XWPF (.docx) instead? (... as the xml structure is probably more straight-forward ...)
Comment 3 Hrishi 2014-01-02 11:14:19 UTC
How can i use the xwpf? because i want to convert .doc files. How can i read *.doc files using xwpf instead xwpf?
Comment 4 Hrishi 2014-01-02 12:09:57 UTC
Also it happens only when i have created the nested table in first row , first cell of the table. otherwise it has rendered as it is in my word file. The only problem is in the table[0,0]