Issue 128261 - An .html crashes OpenOffice 4.1.7 when I use Tools - Spelling and Grammar.
Summary: An .html crashes OpenOffice 4.1.7 when I use Tools - Spelling and Grammar.
Status: CONFIRMED
Alias: None
Product: Writer
Classification: Application
Component: editing (show other issues)
Version: 4.1.7
Hardware: All All
: P5 (lowest) Normal (vote)
Target Milestone: ---
Assignee: AOO issues mailing list
QA Contact:
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-01-09 05:31 UTC by StevenJonKaplan
Modified: 2023-08-08 10:16 UTC (History)
4 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
The attachment is a typical .html file which I create for subscribers. (18.05 KB, text/html)
2020-01-09 05:31 UTC, StevenJonKaplan
no flags Details

Note You need to log in before you can comment on or make changes to this issue.
Description StevenJonKaplan 2020-01-09 05:31:58 UTC
Created attachment 86808 [details]
The attachment is a typical .html file which I create for subscribers.

If you import the following .html file into OpenOffice 4.1.6 or 4.1.7 then as soon as you run Spelling and Grammar it will crash on the line which says "the-sp".  I tried this on multiple computers and the problem occurs on each one.

My name is Steven Jon Kaplan.  My home telephone number is (201) 246-0003.
Comment 1 Dave Barton 2020-01-09 08:57:31 UTC
Tested the html attachment with 4.1.7 on Win10 x64 and Mint 19.2 Cinnamon several times and was unable to reproduce a crash or freeze of any kind. Neither was I able to find a line which says "the-sp" in the html file.
Also opened and ran spellcheck on several other randomly selected html files and was still unable to reproduce your issue.
Comment 2 StevenJonKaplan 2020-01-09 13:57:38 UTC
In order to make this problem easier to resolve I just spent time reducing the problem to a tiny .html section which consistently crashes during a spell check.  Since it is so small I will show it below so that researchers can more easily reproduce the issue:

<p><li class="nav"><a href="https://www.marketwatch.com/story/pimco-says-timing-for-next-recession-delayed-but-central-banks-will-have-less-firepower-to-fight-it-2020-01-08?mod=home-page">
Pimco Says Timing for Next Recession Delayed but Central Banks Will Have Less Firepower to Fight It</a></li></p>


Please have someone look into this.  It crashes running Windows 10 on both a modern Y7000 by Lenovo as well as my ancient Toshiba Satellite with a similar operating system.  All you have to do is to enter the above lines under "Text Document", then run "Tools", "Spelling and Grammar".

If you wish you can telephone me at (201) 246-0003.

Thank you.  --Steven Jon Kaplan
Comment 3 Dave Barton 2020-01-09 16:20:23 UTC
Interestingly, after pasting your html paragraph code snippet into a Writer TEXT document as plain text, it does generate the spellcheck crash, but opening a real html document containing only your code snippet does not.
I was unable to replicate the crash in LibreOffice, which I (maybe incorrectly) believe uses the same spellcheck code.
Comment 4 Keith N. McKenna 2020-01-09 16:44:54 UTC
I have tested this report with both your original file and your compact instructions from Comment 2 with Windows 10 X64 and AOO 4.1.7 and cannot confirm. I am closing this issue. If you have further information that could help us confirm this feel free to re-open this issue.
Comment 5 StevenJonKaplan 2020-01-09 17:03:04 UTC
I am going to attempt to compact the text to the exact tiny piece which causes the crash.  I am sorry that you could not reproduce it but since I have this problem on two machines and Dave Barton was also able to recreate the crash, it must be a real issue.  I would recommend keeping it open.
Comment 6 Keith N. McKenna 2020-01-09 17:10:16 UTC
Re-opening per Comment 5
Comment 7 StevenJonKaplan 2020-01-09 18:00:43 UTC
After additional research I have found the precise character which causes the crash.  Below is a sufficient snippet to crash every time when you click on "Spelling and Grammar"--be sure you say "yes" to continue through the document if it asks if you want to start again at the beginning.  This should be entered as a single line with no spaces or separations; it shows three lines on this forum but it is one continuous line:

<p><li class="nav"><a href="https://www.marketwatch.com/story/pimco-says-timing-for-next-recession-delayed-but-central-banks-will-have-less-firepower-to-fight-it"></a></li></p>

In order for it not to crash, all I have to do is remove a single character:  the t prior to the second double-quote, as follows:

<p><li class="nav"><a href="https://www.marketwatch.com/story/pimco-says-timing-for-next-recession-delayed-but-central-banks-will-have-less-firepower-to-fight-i"></a></li></p>

The above snippet will not crash "Spelling and Grammar".

Notice that the URL I used is a real one, not something I invented.

So there must be an issue related to the specific combination of characters.
Comment 8 Keith N. McKenna 2020-01-09 18:31:10 UTC
With your latest snippet I can confirm
Comment 9 StevenJonKaplan 2020-01-09 18:50:48 UTC
Keith--  I am glad to hear it.  Since a single character seems to make a difference, perhaps that will help to isolate the cause of the issue.  --Steve
Comment 10 Regina Henschel 2020-01-09 20:22:14 UTC
The list-element syntax it wrong. A list-element is not allowed as child element of a <p>element. You can use https://validator.w3.org/#validate_by_upload to validate your files.

I cannot confirm the crash, neither if I import the file into a new text document, nor opening the html-file directly. I use AOO416m1(Build:9790)  -  Rev. 1844436
2018-10-23 12:57
Comment 11 StevenJonKaplan 2020-01-09 20:46:39 UTC
Regina--  That is a useful way of checking .html files and I appreciate your providing the link.

Even if I fix the syntax, if I restore the original link it will always crash the spell checker.

Try doing cut and paste to move the snippet into your text document and then run the spell checker.  That should crash immediately.  --Steve