Created attachment 22385 [details] Contains JUnit test class and documents used for testing. The text contained in a Chart object inserted/created in a word 2007 document is not extracted. Find in attachments the JUnit test class and the document used for testing. We expected to extract the words "testdoc" and "test phrase". Notes on the attached documents: - the document "classic_TextInChart.docx" contains the words "testdoc" and "test phrase" in the Chart inserted in the document. "TestUnitPoi35Filter.java" is the JUnit class.
I'm not sure if we want to be going that far down into graphics objects by default. If you'd like to submit a patch to extract the text, along with a flag to toggle the behaviour on/off, I'll happily apply it to svn :)