Bug 45595 - poi-3.5-beta1-20080718.jar - the WordArt text in a 2003 doc document is not extracted.
Summary: poi-3.5-beta1-20080718.jar - the WordArt text in a 2003 doc document is not e...
Status: RESOLVED WONTFIX
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: unspecified
Hardware: PC Windows Server 2003
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-08-08 03:22 UTC by xtrim
Modified: 2008-08-08 03:29 UTC (History)
0 users



Attachments
Contains JUnit test class and documents used for testing. (35.73 KB, application/x-zip-compressed)
2008-08-08 03:22 UTC, xtrim
Details

Note You need to log in before you can comment on or make changes to this bug.
Description xtrim 2008-08-08 03:22:38 UTC
Created attachment 22411 [details]
Contains JUnit test class and documents used for testing.

The text formated with "WordArt" (= inserted as a "WordArt" object) is not extracted.
The same issue appears with excel 2003, power point 2003, word 2007 and excel 2007 documents  (not tested with power point 2007 documents).


Find in attachments the JUnit test class and the document used for testing.
We expected to extract the words "testdoc" and "test phrase".

Notes on the attached documents:

- the document "classic_TextInWordArt.doc" contains the words "testdoc" and "test phrase" as WordArt Text in the document.

"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-08 03:29:55 UTC
I don't believe POI has any word art support, so as yet this won't be possible.

If you would however like to contribute a patch for some work art support, I'd be only too happy to apply it :)