Issue 56642

Summary: Importing html crashes soffice
Product: Writer Reporter: timbaldwin <timbaldwin>
Component: open-importAssignee: eric.savary
Status: CLOSED FIXED QA Contact: issues@sw <issues>
Severity: Trivial    
Priority: P2 CC: issues, mnagashree
Version: OOo 2.0   
Target Milestone: ---   
Hardware: PC   
OS: Linux, all   
Issue Type: DEFECT Latest Confirmation in: ---
Developer Difficulty: ---
Attachments:
Description Flags
HTML file created by doing Save-As in Thunderbird. The file opens successfully in Mozilla HTML editor.
none
A very simple file causing the crash none

Description timbaldwin 2005-10-25 18:33:14 UTC
Soffice crashes while importing (File->Open) html file.  I'll attach the
offending html to this bug report.
Comment 1 timbaldwin 2005-10-25 18:35:10 UTC
Created attachment 30825 [details]
HTML file created by doing Save-As in Thunderbird.  The file opens successfully in Mozilla HTML editor.
Comment 2 michael.ruess 2005-10-26 09:01:20 UTC
Reassigned to ES.
Comment 3 eric.savary 2005-10-28 14:46:46 UTC
ES->TRA: amazingly the crash is due to the <img src> tag. When an image has as
source a character combination like <alphanum_character><hash><question_mark>
this leads zo a crash.
Regression compared to OOo 1.1.5 -> target 2.0.2
Comment 4 eric.savary 2005-10-28 14:52:30 UTC
Created attachment 30961 [details]
A very simple file causing the crash
Comment 5 raphi72 2005-10-31 01:54:56 UTC
The SEGV seems to be in SfxApplication::LoadTemplate ()
from /usr/lib/ooo-2.0/program/libsfx680li.so

Here's a backtrace:

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread 1098461856 (LWP 6244)]
0x4436bf54 in SfxApplication::LoadTemplate ()
   from /usr/lib/ooo-2.0/program/libsfx680li.so
(gdb) bt
#0  0x4436bf54 in SfxApplication::LoadTemplate ()
   from /usr/lib/ooo-2.0/program/libsfx680li.so
#1  0x443ca159 in SfxMedium::GetFileVersion ()
   from /usr/lib/ooo-2.0/program/libsfx680li.so
#2  0x443cac03 in SfxMedium::SfxMedium ()
   from /usr/lib/ooo-2.0/program/libsfx680li.so
#3  0x45c94bdf in SvxLinkManager::SvxLinkManager ()
   from /usr/lib/ooo-2.0/program/libsvx680li.so
#4  0x45c9515a in SvxLinkManager::SvxLinkManager ()
   from /usr/lib/ooo-2.0/program/libsvx680li.so
#5  0x45607e05 in SwSectionFmt::~SwSectionFmt ()
   from /usr/lib/ooo-2.0/program/libsw680li.so
#6  0x457fcaff in SwDocShell::StartLoadFinishedTimer ()
   from /usr/lib/ooo-2.0/program/libsw680li.so
#7  0x456eeac7 in SwIoSystem::IsFileFilter ()
   from /usr/lib/ooo-2.0/program/libsw680li.so
#8  0x406f12ff in SvRefBase::QueryDelete ()
   from /usr/lib/ooo-2.0/program/libtl680li.so
#9  0x40594478 in HTMLParser::_GetNextToken ()
   from /usr/lib/ooo-2.0/program/libsvt680li.so
#10 0x40594d5a in SvParser::NewDataRead ()
   from /usr/lib/ooo-2.0/program/libsvt680li.so
Comment 6 flr 2005-11-02 15:28:34 UTC
flr/tra:
The problem is, that the URL contains a "#" mark. When this is present the
creation of an SfXMedium SvFileObject::LoadFile_Impl() --- located in
svx/source/svxlink/fileobj.cxx --- will crash.

Comment 7 tino.rachui 2005-11-04 13:01:09 UTC
tra->mav: I was told this is your playground.
Comment 8 mikhail.voytenko 2005-11-24 17:03:44 UTC
Fixed.
Comment 9 eric.savary 2005-12-15 11:25:01 UTC
*** Issue 59377 has been marked as a duplicate of this issue. ***
Comment 10 mikhail.voytenko 2006-01-24 09:55:00 UTC
MAV->ES: Please verify the issue.

re-open issue and reassign to es@openoffice.org
Comment 11 mikhail.voytenko 2006-01-24 09:57:03 UTC
reassign to es@openoffice.org
Comment 12 mikhail.voytenko 2006-01-24 09:57:22 UTC
reset resolution to FIXED
Comment 13 eric.savary 2006-01-27 12:15:23 UTC
VERIFIED in CWS mav19
Comment 14 eric.savary 2006-02-08 12:56:55 UTC
Ok in src680m156