Bug 45557 - The name of a document (docx, xlsx, or pptx) linked to a 2007 docx document is not extracted
Summary: The name of a document (docx, xlsx, or pptx) linked to a 2007 docx document i...
Status: NEW
Alias: None
Product: POI
Classification: Unclassified
Component: POI Overall (show other bugs)
Version: unspecified
Hardware: PC All
: P5 enhancement (vote)
Target Milestone: ---
Assignee: POI Developers List
URL:
Keywords:
: 45564 (view as bug list)
Depends on:
Blocks:
 
Reported: 2008-08-05 05:48 UTC by xtrim
Modified: 2017-01-05 13:02 UTC (History)
0 users



Attachments
Contains JUnit test class and documents used for testing. (274.17 KB, application/x-zip-compressed)
2008-08-05 05:48 UTC, xtrim
Details
junit test case (256.85 KB, application/gzip)
2016-06-20 09:18 UTC, Javen O'Neal
Details

Note You need to log in before you can comment on or make changes to this bug.
Description xtrim 2008-08-05 05:48:10 UTC
Created attachment 22380 [details]
Contains JUnit test class and documents used for testing.

The filename of a document linked to the current word 2007 document is not extracted.
Find in attachments the JUnit test class and the documents used for testing.
We expected to extract the word "testdoc" (this word is part of the filename of the linked document).

Notes on the attached documents:


- the documents "classic_NameLinkedObject_word.docx" and "form_NameLinkedObject_word.docx" contain the word "testdoc" in the filename of the docx linked document.

- the documents "classic_NameLinkedObject_excel.docx" and "form_NameLinkedObject_excel.docx" contain the word "testdoc" in the filename of the xlsx linked document.

- the documents "classic_NameLinkedObject_ppt.docx" and "form_NameLinkedObject_ppt.docx" contain the word "testdoc" in the filename of the pptx linked document.


"TestUnitPoi35Filter.java" is the JUnit class.
Comment 1 Javen O'Neal 2016-06-20 08:56:41 UTC
*** Bug 45564 has been marked as a duplicate of this bug. ***
Comment 2 Javen O'Neal 2016-06-20 09:18:05 UTC
Created attachment 33967 [details]
junit test case

POI text extraction does not identify the name of linked documents via POITextExtractor.getText() as of POI 3.15 beta 2.

A similarly named test, "ExtractorFactory.getEmbeddedDocsTextExtractors", is currently not implemented.
Comment 3 Javen O'Neal 2017-01-05 13:02:39 UTC
Original bug report was generated using poi-3.5-beta1-20080718.jar