Apache OpenOffice (AOO) Bugzilla – Full Text Issue Listing
|Summary:||Implement XHTML import filter with MathML & XForms support|
|Component:||code||Assignee:||AOO issues mailing list <issues>|
|Status:||UNCONFIRMED ---||QA Contact:|
|Issue Type:||ENHANCEMENT||Latest Confirmation in:||---|
Description vsedov 2007-11-09 14:55:02 UTC
we can use new process of importing both HTML&XHTML files 1. if file is valid XHTML 1.0 Strict - goto 3 2. sanitize source with tidy (http://www.w3.org/People/Raggett/tidy/) 3. perform import of XHTML 1.0 Strict i guess it will have more sense in terms of "feature-proof" - it allow to drop support for old "street HTML" filter and free some C++ developer resources, move transformation logic to XSLT and have one importing logic for both XHTML&HTML sources. Also it will be good for tidy project - extended testing base. Some open office issues will be directed to tidy engineers. Some bugs will be fixed even without issue report to OpenOffice.org (just after upgrading tidy). Also tidy may be used for prepare not XHTML web-pages for extracting data using XPath and XQuery. As XSLT developer i can help in greating XHTML import filter.
Comment 1 michael.ruess 2007-11-09 15:16:56 UTC
Reassigned to JSI.
Comment 2 jogi 2008-01-15 06:47:46 UTC
@sus: FRom my point of view issue 83249 is duplicate to this one.
Comment 3 jogi 2008-01-15 06:49:07 UTC
changing components because XHTML is based on 'xmloff' and there is no XHTML implementation directly in 'word processor' code.
Comment 4 jogi 2008-01-15 06:51:39 UTC
*** Issue 83249 has been marked as a duplicate of this issue. ***
Comment 5 svante.schubert 2008-05-25 14:14:11 UTC
I guess this is more an enhancement to be figured out by our requirement guys. Changed owner...
Comment 6 xquery 2008-09-12 16:20:21 UTC
Created attachment 56480 [details] XSLT 2.0 first iteration of XHTML import sample
Comment 7 xquery 2008-09-12 16:29:10 UTC
i post first iteration of XHTML import filter (XSLT 2.0) - it already support MathML, encoded images (same as XHTML export), support of XForms coming soon at least it better then plain text so i suggest to include this as XSLT 2.0 import filter sample to OO 3.0 so peoples can be involved for this filter testing and evolution
Comment 8 gleppert 2009-01-31 12:08:04 UTC
@xquery: it's great to see the development of an xhtml import filter. Has anything happened/changed since September 08? Is it planned to include it in the default install set of OpenOffice.org? How can I install and test your filter? Thank you Gerald
Comment 9 xquery 2009-01-31 21:14:32 UTC
it was just possibility testing, for now this task stalled at requirments from last Tuesday i am leaved Wiley so i can spend some extra time at this task Gerald, as "first customer" and voter on this task you can help by providing some xhtml as sample and testcase
Comment 11 gleppert 2009-06-13 09:14:12 UTC
Hello, are there any news on the XHTML import filter? As I was trying the last updated filter by xquery (attachment from Feb1), I was impressed, because it is much better than the html import, which is built in in OpenOffice.org. When will it be included in the default install set? Furthermore, are there any news on the import of external CSS during XHTML import? Thank you Gerald
Comment 12 xquery 2009-06-13 12:28:34 UTC
i can implement it if someone will sponsor my development (about 900$ per month i need just to survive) i guess it can consume about 3-4 months (with CSS & XForms support) so it will be 2700$ - 3600$ total