Bug 45602

Summary: Add Java API for MS Publisher .pub files
Product: POI Reporter: Dmitry Goldenberg <dgoldenberg>
Component: POI OverallAssignee: POI Developers List <dev>
Severity: critical CC: dgoldenberg
Priority: P1    
Version: unspecified   
Target Milestone: ---   
Hardware: All   
OS: All   
Bug Depends on:    
Bug Blocks: 51317    
Attachments: A sample brochure.
A sample newsletter.

Description Dmitry Goldenberg 2008-08-08 12:22:49 UTC
Created attachment 22418 [details]
A sample brochure.

New capability is needed for being able to extract all metadata, textual content, hyperlinks, and any embedded documents from a given MS Publisher (.pub) document.

Priority one would be metadata and content. Priority two would be hyperlinks and embeddings.

Please see the two attached sample .pub files.
Comment 1 Dmitry Goldenberg 2008-08-08 12:23:36 UTC
Created attachment 22419 [details]
A sample newsletter.
Comment 2 Nick Burch 2008-09-07 12:54:17 UTC
HPBF now provides this