Apache OpenOffice (AOO) Bugzilla – Issue 80557
sl_SI Slovenian: alphanumerical sorting CE characters Đ
Last modified: 2013-08-07 15:01:53 UTC
Alphanumeric sorting in all aplications using Central European characters has an isue with letter Đ (if you go to add special symbol this letter is char U+0110 and U+0111) When I try to sort it thinks letters D and Đ are used as same letter Example I have column: DA DG ĐA ĐB ĐR DS after alphanumerical sorting it makes: DA ĐA ĐB DG ĐR DS correct would be: DA DG DS ĐA ĐB ĐR
Thank you, I'll take a look at it. I guess it is defined in the sl.xml file but I have to ask on l10n-dev maillist.
The collation order (sorting) is indeed specified in the locale. My language also uses the crossed D (đ, Đ), which comes after the ordinary D in our collation order. We have an open collation bug [78054] posted against Calc, but it's about collating words with some combinations of diacritics. OpenOffice.org follows the collation order in our locale for the crossed D. It should do the same for other languages.
Eike, I reassigned this to you. Like you told me last time, I am sending only diff (the last line is the right/corrected one) of sl_SI.xml: $diff sl_SI.orig.xml sl_SI.xml 191c191 < <IndexKey phonetic="false" default="true" unoid="alphanumeric">A-C Č D-S Š T-Z Ž</IndexKey> --- > <IndexKey phonetic="false" default="true" unoid="alphanumeric">A-C Č Ć D Đ-S Š T-Z Ž</IndexKey>
... and changing the Component
ah, component should be l10n, sorry
and subcomponent localedata
Is this issue really about Slovenian (sl_SI)? Because for Slovakian (sk_SK) it works as requested. For collation the locale data files are not involved. Tailored collations are done in i18npool/source/collator/data/ files. The locale data file's IndexKey sequence is for the Alphabetical Index in Writer. Anyway, when setting the language to Slovakian in the sort dialog options, or when left as Default and setting the OOo locale to Slovakian under Tools.Options.LanguageSettings.Languages "Language of Locale Setting", the sort result is DA DG DS ĐA ĐB ĐR so OOo does sort that correctly for Slovakian. The same should be true if the system locale is set to sk_SK and OOo locale left at Default. However, the IndexKey could be adapted. Are you sure this is about Slovenian? See also the COMMON column in http://www.unicode.org/cldr/data/charts/collation/sk_SK.html http://www.unicode.org/cldr/data/charts/collation/sl_SI.html Slovenian does not have a special treatment for đ and Đ.
Could someone please clarify? Is this issue about Slovenian or Slovakian? See previous comment. Thanks Eike
Eike, this issue is about Slovenian. We don't have letters "đ" and "Đ" in the "COMMON column in http://www.unicode.org/cldr/data/charts/collation/sl_SI.html" because, I guess, these two letters are not in our (Slovenian) alphabet. But we still use them as we were part of Yugoslavia once, where they still use them. If I set the language to Slovenian in the sort dialog options, the result is not OK (of course). What is best to do? To correct IndexKey in sl_SI.xml or add these 2 letters in http://www.unicode.org/cldr/data/charts/collation/sl_SI.html? Thanks
Bobe, As stated in #desc8 please note that collation and the OOo IndexKey are different things. If you want future versions of ICU (and thus other applications as well) to support the Slovenian collation including Đ you'll also have to contact the CLDR. For OOo we may create a tailored collation in i18npool/source/collator/data/ Eike
Karl, could you please take over and add collation and IndexKey in an i18n cws? Thanks Eike
Is there any progress in solving this issue? It is quite annoying to sort manually these letters. Thank you, Marko
Add tailoring data and index key characters as suggested.
I am sorry to bother again but I searched everywhere and could not find any locales and sl_SI.xml. I instaled new version 2.3.1 over 2.3 (now I have both ?!?funny) and still no locales. Can you give me directions where to find locales? I am using Vista OS. thanks!
@markomb: The locale data annd collation files we're talking here about are in the OOo source code build tree, not in the installation. They get compiled into binary libraries that are used during runtime.
So I cannot use it until new version?
ready for QA.
khong, where can I find the newest version of sl_SI.xml? I need to post it to Pavel, too, so he can include in in the new builds. Thanks.
@bobe: No need to provide Pavel with single files. He builds the master milestones, so once CWS i18n39, where the fix for this issue was added to, will be integrated everything will be fine. For monitoring the CWS see http://eis.services.openoffice.org/EIS2/cws.ShowCWS?Path=SRC680%2Fi18n39
Verified in CWS i18n39.
OK in OOO320_m7. Closed.