Bug 45595

Summary: poi-3.5-beta1-20080718.jar - the WordArt text in a 2003 doc document is not extracted.
Product: POI Reporter: xtrim <grizolle_benedicte>
Component: POI OverallAssignee: POI Developers List <dev>
Status: RESOLVED WONTFIX    
Severity: normal    
Priority: P2    
Version: unspecified   
Target Milestone: ---   
Hardware: PC   
OS: Windows Server 2003   
Attachments: Contains JUnit test class and documents used for testing.

Description xtrim 2008-08-08 03:22:38 UTC
Created attachment 22411 [details]
Contains JUnit test class and documents used for testing.

The text formated with "WordArt" (= inserted as a "WordArt" object) is not extracted.
The same issue appears with excel 2003, power point 2003, word 2007 and excel 2007 documents  (not tested with power point 2007 documents).


Find in attachments the JUnit test class and the document used for testing.
We expected to extract the words "testdoc" and "test phrase".

Notes on the attached documents:

- the document "classic_TextInWordArt.doc" contains the words "testdoc" and "test phrase" as WordArt Text in the document.

"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-08 03:29:55 UTC
I don't believe POI has any word art support, so as yet this won't be possible.

If you would however like to contribute a patch for some work art support, I'd be only too happy to apply it :)