Bug 45562 - poi-3.5-beta1-20080718.jar - content from a Chart object of a 2007 docx document is not extracted.
Summary: poi-3.5-beta1-20080718.jar - content from a Chart object of a 2007 docx docum...
Status: RESOLVED WONTFIX
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: unspecified
Hardware: PC Windows Server 2003
: P2 normal (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-08-05 07:40 UTC by xtrim
Modified: 2008-08-05 15:52 UTC (History)
0 users



Attachments
Contains JUnit test class and documents used for testing. (71.86 KB, application/x-zip-compressed)
2008-08-05 07:40 UTC, xtrim
Details

Note You need to log in before you can comment on or make changes to this bug.
Description xtrim 2008-08-05 07:40:06 UTC
Created attachment 22385 [details]
Contains JUnit test class and documents used for testing.

The text contained in a Chart object inserted/created in a word 2007 document is not extracted.
Find in attachments the JUnit test class and the document used for testing.
We expected to extract the words "testdoc" and "test phrase".

Notes on the attached documents:

- the document "classic_TextInChart.docx" contains the words  "testdoc" and "test phrase" in the Chart inserted in the document.


"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Nick Burch 2008-08-05 15:52:23 UTC
I'm not sure if we want to be going that far down into graphics objects by default.

If you'd like to submit a patch to extract the text, along with a flag to toggle the behaviour on/off, I'll happily apply it to svn :)