Issue 99131 - Word count incorrect for two words joined by an em dash
Summary: Word count incorrect for two words joined by an em dash
Status: CLOSED DUPLICATE of issue 89042
Alias: None
Product: Writer
Classification: Application
Component: code (show other issues)
Version: OOo 3.0.1
Hardware: All All
: P3 Trivial (vote)
Target Milestone: ---
Assignee: thomas.lange
QA Contact: issues@sw
: 107093 (view as issue list)
Depends on:
Reported: 2009-02-11 19:05 UTC by logiconcepts819
Modified: 2013-08-07 14:44 UTC (History)
3 users (show)

See Also:
Issue Type: DEFECT
Latest Confirmation in: ---
Developer Difficulty: ---


Note You need to log in before you can comment on or make changes to this issue.
Description logiconcepts819 2009-02-11 19:05:51 UTC
In 3.0.1, there is a small yet annoying problem that occurs when
two words are joined by an em dash.  For instance, type the following sentence
into a blank document:

I am experienced with computers—and there is no doubt about it.

According to OpenOffice, the word count is 11.  However, the correct word count
is 12.  The trouble spot is where the words "computers" and "and" are joined by
the em dash.  As explained before, if you highlight this region, OpenOffice will
think the word count there is 1, but the actual word count is 2.

To reproduce the problem:
1. Open Writer.
2. Enter the text into the blank document as above.
3. Click the "Tools" menu.
4. Click the "Word Count" menu item to verify the incorrect word count.
Comment 1 michael.ruess 2009-02-19 13:08:34 UTC
MRU->AMA: could you please redirect to appropriate deeloper as long as FME is
not responsible for "Word count" feature anymore? Thanks!

Can confirm this in OOO310m1 build. Just paste the text
I am experienced with computers—and there is no doubt about it.
into Writer and call "Word count". Writer counts 11 words, while a competing
product correctly counts 12 words.
Comment 2 Rainer Bielefeld 2009-03-28 17:53:30 UTC
I checked with "Ooo 3.0.1 (DE) Multilingual version GERMAN UI WIN XP: [OOO300m15
(Build 9379)]" and can confirm the reported effect.
Comment 3 stefan.baltzer 2009-11-27 15:46:46 UTC
Reassigned to TL.
Comment 4 stefan.baltzer 2009-11-27 15:49:17 UTC
*** Issue 107093 has been marked as a duplicate of this issue. ***
Comment 5 stefan.baltzer 2009-11-27 15:51:54 UTC
Note: Counting non-characters as words must be solved "all at once". 
Keeping an issue for each miscounted symbol makes not much sense.

See also issue 89042 "word count is incorrect with certain special characters
(custom quotes) in the text."

*** This issue has been marked as a duplicate of 89042 ***
Comment 6 stefan.baltzer 2009-11-27 15:54:51 UTC
I put the "em-dash" example in the summary of issue 89042.

Closing this one.