Apache OpenOffice (AOO) Bugzilla – Issue 76620
Duplicate matches in indexes will only show the sub-string keyword
Last modified: 2017-05-20 11:15:12 UTC
Hi, To reproduce: - use the .sdi file joined to the issue to make your index - all the entries are displayed but not the ones where the second word contains an apostrophe (i.e modèle d'impression) Whatever the way you filled the .sdi file (via ui, or notepad), the entries are not displayed. Kind regards Sophie
Created attachment 44617 [details] .sdi file for example
Reassigned to ES
Interesting... This issue shows 3 different things: 1) The real it problem is not the apostroph but the fact that a keyword is found 2 times in the same string. The bug can be described as following: if a keyword "A" is part of another keyword "A B" (or "B A", the order doesn't matter) AND "A" is placed before "A B" in the *.sdi file, the index will only display "A". Ex: a) the sdi looks like: bug;bug;;;0;0 what a nice bug;what a nice bug;;0;0 -> The index will only display Bug.................1 b) the sdi looks like: what a nice bug;what a nice bug;;0;0 bug;bug;;;0;0 -> The index will only display Bug................................1 What a nice bug..............1 ---------- Other remarks which are not parts of the problem: 2) Sophie, your file shows "Mod�le d'impression;Mod�le d'impression;Impression;;0;1" Yes, with a broken "è". Maybe an import/export/IssueZilla problem...? 3) You may file an ENHANCEMENT (if not yet done) for "apostroph recognition in indexes". Indeed, just reading this issue, I thought the problem was that you had used normal simple quotes (apostrophs) in the sdi but your text would contain only formatted quotes. In this case, because unformatted and formatted quotes are 2 different characters, the formatted quotes won't be recognized. The enhacement would be to treat unformatted and formatted quotes as the same character.
Hi Eric, I agree with 1) for 2), it's not my file, but a user one but I can confirm it's not an IZ issue as it shows the same wrong è on my system Concerning 3) I don't understand what is formatted quotes, could you explain me what it stand for ? What is really annoying me is that part of your data are not shown, it makes the .sdi files not usable or trustable in fact. and 4) whaou, your summary is really better than mine, thanks :-) Kind regards - Sophie
move target to 3.x according http://wiki.services.openoffice.org/wiki/Target_3x
Reset assigne to the default "issues@openoffice.apache.org".