Issue 83494 - Implement XHTML import filter with MathML & XForms support
Summary: Implement XHTML import filter with MathML & XForms support
Status: UNCONFIRMED
Alias: None
Product: xml
Classification: Code
Component: code (show other issues)
Version: OOo 2.3
Hardware: All All
: P2 Trivial with 1 vote (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
URL:
Keywords:
: 83249 (view as issue list)
Depends on:
Blocks:
 
Reported: 2007-11-09 14:55 UTC by vsedov
Modified: 2013-02-07 22:32 UTC (History)
1 user (show)

See Also:
Issue Type: ENHANCEMENT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
XSLT 2.0 first iteration of XHTML import sample (28.32 KB, text/xml)
2008-09-12 16:20 UTC, xquery
no flags Details
new (28.68 KB, text/xml)
2009-02-01 20:00 UTC, xquery
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description vsedov 2007-11-09 14:55:02 UTC
we can use new process of importing both HTML&XHTML files

1. if file is valid XHTML 1.0 Strict - goto 3
2. sanitize source with tidy (http://www.w3.org/People/Raggett/tidy/)
3. perform import of XHTML 1.0 Strict

i guess it will have more sense in terms of "feature-proof" - it allow to drop
support for old "street HTML" filter and free some C++ developer resources, move
transformation logic to XSLT and have one importing logic for both XHTML&HTML
sources. Also it will be good for tidy project - extended testing base. Some
open office issues will be directed to tidy engineers. Some bugs will be fixed
even without issue report to OpenOffice.org (just after upgrading tidy). Also
tidy may be used for prepare not XHTML web-pages for extracting data using XPath
and XQuery.

As XSLT developer i can help in greating XHTML import filter.
Comment 1 michael.ruess 2007-11-09 15:16:56 UTC
Reassigned to JSI.
Comment 2 jogi 2008-01-15 06:47:46 UTC
@sus: FRom my point of view issue 83249 is duplicate to this one.
Comment 3 jogi 2008-01-15 06:49:07 UTC
changing components because XHTML is based on 'xmloff' and there is no XHTML
implementation directly in 'word processor' code.
Comment 4 jogi 2008-01-15 06:51:39 UTC
*** Issue 83249 has been marked as a duplicate of this issue. ***
Comment 5 svante.schubert 2008-05-25 14:14:11 UTC
I guess this is more an enhancement to be figured out by our requirement guys.
Changed owner...
Comment 6 xquery 2008-09-12 16:20:21 UTC
Created attachment 56480 [details]
XSLT 2.0 first iteration of XHTML import sample
Comment 7 xquery 2008-09-12 16:29:10 UTC
i post first iteration of XHTML import filter (XSLT 2.0) - it already support 
MathML, encoded images (same as XHTML export), support of XForms coming soon


at least it better then plain text so i suggest to include this as XSLT 2.0 
import filter sample to OO 3.0 so peoples can be involved for this filter 
testing and evolution
Comment 8 gleppert 2009-01-31 12:08:04 UTC
@xquery: it's great to see the development of an xhtml import filter. Has
anything happened/changed since September 08? Is it planned to include it in the
default install set of OpenOffice.org?

How can I install and test your filter?

Thank you
Gerald 
Comment 9 xquery 2009-01-31 21:14:32 UTC
it was just possibility testing, for now this task stalled at requirments

from last Tuesday i am leaved Wiley so i can spend some extra time at this task 

Gerald, as "first customer" and voter on this task you can help by providing 
some xhtml as sample and testcase
Comment 10 xquery 2009-02-01 20:00:08 UTC
Created attachment 59809 [details]
new
Comment 11 gleppert 2009-06-13 09:14:12 UTC
Hello,

are there any news on the XHTML import filter? As I was trying the last updated
filter by xquery (attachment from Feb1), I was impressed, because it is much
better than the html import, which is built in in OpenOffice.org. When will it
be included in the default install set?

Furthermore, are there any news on the import of external CSS during XHTML import?

Thank you
Gerald
Comment 12 xquery 2009-06-13 12:28:34 UTC
i can implement it if someone will sponsor my development (about 900$ per month i 
need just to survive)

i guess it can consume about 3-4 months (with CSS & XForms support) so it will be 
2700$ - 3600$ total