Issue 10976 - Errors in Norwegian locales
Summary: Errors in Norwegian locales
Status: CLOSED FIXED
Alias: None
Product: Internationalization
Classification: Code
Component: code (show other issues)
Version: OOo 1.1 Beta2
Hardware: All All
: P3 Trivial with 1 vote (vote)
Target Milestone: ---
Assignee: oc
QA Contact: issues@l10n
URL:
Keywords:
: 7418 7905 (view as issue list)
Depends on:
Blocks:
 
Reported: 2003-01-27 09:45 UTC by gaute
Modified: 2013-08-07 15:02 UTC (History)
1 user (show)

See Also:
Issue Type: PATCH
Latest Confirmation in: ---
Developer Difficulty: ---


Attachments
Corrected Norwegian Bokmål locale file (16.02 KB, text/plain)
2003-01-27 09:46 UTC, gaute
no flags Details
Corrected Norwegian Nynorsk locale file (16.02 KB, text/plain)
2003-01-27 09:47 UTC, gaute
no flags Details
Corrected Norwegian locale file (The same as nb_NO. I don't know if this one is ever used.) (16.02 KB, text/plain)
2003-01-27 09:48 UTC, gaute
no flags Details
Patch for all three Norwegian locale files, updated to 1.1 beta 2 (46.29 KB, patch)
2003-06-10 09:46 UTC, gaute
no flags Details | Diff
New patch, following suggestions from Eike (23.58 KB, patch)
2003-06-11 14:23 UTC, gaute
no flags Details | Diff
New patch, now with proper Unicode single quotes (23.55 KB, patch)
2003-06-18 10:30 UTC, gaute
no flags Details | Diff

Note You need to log in before you can comment on or make changes to this issue.
Description gaute 2003-01-27 09:45:23 UTC
The Norwegian locale files, found in i18npool/source/localedata_ascii/, are full of errors. There are three Norwegian files: nb_NO for Norwegian Bokmål, nn_NO for Norwegian Nynorsk, and no_NO (I don't know if this last one is used, it should be the same as for Bokmål).

On behalf of the Norwegian localization team, I have corrected the locale files according to Norwegian standards. Among the things corrected are date formats, time formats, quoting characters and abbreviations.

I attach the corrected locale files.
Comment 1 gaute 2003-01-27 09:46:28 UTC
Created attachment 4440 [details]
Corrected Norwegian Bokmål locale file
Comment 2 gaute 2003-01-27 09:47:31 UTC
Created attachment 4441 [details]
Corrected Norwegian Nynorsk locale file
Comment 3 gaute 2003-01-27 09:48:28 UTC
Created attachment 4442 [details]
Corrected Norwegian locale file (The same as nb_NO. I don't know if this one is ever used.)
Comment 4 Dieter.Loeschky 2003-02-25 08:45:08 UTC
DL->Gaute: Thanks for you patch. To accept your patch we need your
signed copyright assignment. Could you please send us the copyright
assignment asap? Please find more info at:
http://www.openoffice.org/contributing.html
Comment 5 eric.savary 2003-04-16 14:38:06 UTC
*** Issue 7418 has been marked as a duplicate of this issue. ***
Comment 6 oc 2003-05-28 11:17:58 UTC
*** Issue 7905 has been marked as a duplicate of this issue. ***
Comment 7 gaute 2003-05-28 12:33:48 UTC
The JCA is now faxed, and I'm mailing it as well later today.

I ask that you do not yet commit the corrected files. I will test them
with 1.1 (beta2) first, and I suspect they need some minor revision 
as well.
Comment 8 gaute 2003-06-10 09:46:44 UTC
Created attachment 6765 [details]
Patch for all three Norwegian locale files, updated to 1.1 beta 2
Comment 9 gaute 2003-06-10 09:58:19 UTC
The patch norwegian_locales.diff will correct the current Norwegian 
locale files in 1.1 beta 2.

Please consider this patch for inclusion in the next 1.1 build.
Comment 10 ooo 2003-06-11 12:40:05 UTC
What's this, this issue is still hanging around as being UNCONFIRMED
since months. I'm grabbing it.

Gaute,
Thanks for filling out the JCA form.
I won't accept this very patch for the following reasons:

1. The patch does not reflect the latest revisions of the data files
as found in i18npool/source/localedata/data/. This is indicated, for
example, by the value of the <Collator> entity, which still has the
locale name included, or the absence (removal) of the
<quarter#Abbreviation> entities, or the reintroduction of meaningless
<DefaultName> entities with value "DIN 5008" and similar. Please take
a look at the latest revisions available.

2. It unnecessarily duplicates a lot of data by replacing ref="en_US"
respectively ref="no_NO" entries with the actual data. This is
especially completely the case for the content of nb_NO.xml, which in
fact is only an alias of no_NO that only differs in the assignment of
 the MS-LANGID. But these ref="..." replacements are also encountered
in other details.

3. The currency format codes are screwed up, since all use [$kr-414] 
now, which instead has to be [$kr-814] for nn_NO and [$kr-14] for
no_NO (this is the MS-LANGID thing mentioned above). Again the patch
replaces the ref="..." and replaceFrom="..." and replaceTo="..."
mechanism with actual data.

4. It seems that also some date format codes were screwed, I didn't
take a detailed look and did no tests in the non-product version
though to verify them, because of the other errors described above.
Please follow the guidelines commented in the latest revision of
i18npool/source/localedata/data/locale.dtd

Please rework and submit a clean patch, thank you.

Btw: are you sure about the <QuotationStart> and <QuotationEnd>
typographic quotation values? They look strange to me, please bear in
mind that all values have to be UTF-8 encoded.
Comment 11 gaute 2003-06-11 14:21:31 UTC
OK, here's a new patch which is more carefully based on the new locale files (1.1b2).

This patch makes heavy use of the ref="" mechanism, with no_NO.xml as the 
"template" file. The only major difference between nb and nn locales is the names 
of the weekdays (the months have the same names). Because of this, I was able
to make nn_NO.xml considerably shorter.


<QuotationStart/End> does look strange, but they did work in 1.0.x. Anyway, the 
proper single quote characters does not exist in ISO-8859-1. I don't like it, but we may 
have to resort to the regular inch character (') instead. (The Windows character set 
contains the necessary quote characters, but on other operating systems you may get 
empty boxes or question marks (which is *really* bad) instead.)

Most Norwegian will use the double quotes anyway, which are «these».


Thanks for your help and patience!
Comment 12 gaute 2003-06-11 14:23:48 UTC
Created attachment 6799 [details]
New patch, following suggestions from Eike
Comment 13 ooo 2003-06-12 12:04:10 UTC
Thanks for the data. I doubt that this will make it into 1.1RC since I
won't start on it before next week due to other tasks I have to
complete. I therefore set the target to 1.1 final but will try to get
it into RC if time permits and release schedule allows.
Comment 14 ooo 2003-06-12 12:22:46 UTC
Forgot to set issue to accepted.
Comment 15 ooo 2003-06-13 12:37:32 UTC
Gaute,

As for the <QuotationStart> issue: internally OOo uses Unicode, and
UTF-8 in XML file format, so specifying the real quotation marks
should be no problem, we do it with all other locales too. For
example, the German typographic double quotes aren't present in
iso-8859-1 either but only in Windows-1252. When exported to an
iso-8859-1 encoding they get replaced by simple ASCII quotes.

Please specify the real quotation marks.
Comment 16 gaute 2003-06-13 19:53:39 UTC
Hmm, I have bad experiences with quotes and Windows-1250. I've seen documents 
with Windows-1250 quotes appearing as question marks on Linux systems. That's 
really, really bad. (The old Norwegian locale files did exactly that.)

If I understand correctly, I can specify the real Unicode characters (not Windows-1250).
Then, the quotes will always degrade gracefully, and never appear as question marks
or boxes, right?
Comment 17 ooo 2003-06-16 11:54:54 UTC
As said before, internally OOo uses UCS2 Unicode, and UTF-8 in the XML
file format. There is no such thing like "Windows-1250 encoding in a
OOo document" if it wasn't imported from an 8-bit file format and a
wrong encoding was chosen. The question mark issue isn't a problem of
encoding but a problem of inadequate fonts. With proper fonts the
right symbols are displayed.

Yes, please specify the real Unicode symbols in UTF-8 encoding.

Upon runtime the user can specify (under menu
Tools.AutoCorrect.CustomQuotes) if and how the ASCII quotes should be
replaced during typing. If for any reason the font you used didn't
contain the proper symbols, you can always disable replacement (thus
use plain ASCII only) or choose another character.
Comment 18 gaute 2003-06-18 10:28:51 UTC
Sorry for the delay, here's a patch with single quotes in Unicode. Both start and end
quotes should be the same, which is a raised comma just like the single end quote 
in en_US.

And of course, if OpenOffice.org can't display a proper fallback character, that's not
the locale's fault. Issue #11018 shows what happens in 1.0.2 with certain fonts on my
Linux computer.
Comment 19 gaute 2003-06-18 10:30:21 UTC
Created attachment 6955 [details]
New patch, now with proper Unicode single quotes
Comment 20 ooo 2003-06-18 18:23:41 UTC
Thanks. Note however that I applied the following changes to comply
with the requirements lined out in
i18npool/source/localedata/data/locale.dtd:

Changed <Time100SecSeparator> from '.' (dot) to ',' (comma) as it must
match the separator used in the format codes (formatindex="44" and
formatindex="45" in this case).

Changed date format code formatindex="20" from >YYYYMMDD< to
>DD.MM.YY< to have a similar meaning as the same format index in other
locales. Was the >YYYYMMDD< really intended?

Committed to branch cws_srx645_ooo11rc:

i18npool/source/localedata/data/nb_NO.xml  1.5.70.1
i18npool/source/localedata/data/nn_NO.xml  1.7.28.1
i18npool/source/localedata/data/no_NO.xml  1.6.28.1
Comment 21 gaute 2003-06-18 19:03:52 UTC
Using a comma as <Time100SecSeparator> is indeed correct. I must have 
overlooked it when revising the patch.

YYYYMMDD was a "filler" format, as nothing else was really appropriate. Since 
similarity with other locales is a concern, I believe DD.MM.YY is a better choice.
However, we chose DD.MM.YY as DateFormatskey1, so it's now being used in 
two places. Switching to D.M.YY as DateFormatskey1 would solve this, and I 
believe it would be more consistent with other locales as well.

Please consider changing DateFormatskey1 to D.M.YY, which I think would be
the final tweak needed.
Comment 22 ooo 2003-06-18 20:11:56 UTC
Duplicated format codes don't really matter, they just may not look
nice in the listbox ;-)

In fact all the formatindex="..." matching has historical roots,
please refer to i18npool/source/localedata/data/locale.dtd and
offapi/com/sun/star/i18n/NumberFormatIndex.idl for further information.
Comment 23 ooo 2003-07-02 13:28:09 UTC
Present in ooo11rc CWS build, reassign to QA.
Comment 24 oc 2003-07-02 14:12:33 UTC
Reset resolution to fixed
Comment 25 oc 2003-07-02 14:13:07 UTC
Verified in internal build OOo11RC
Comment 26 oc 2003-07-16 10:02:55 UTC
Closed because fixed in OOo1.1.RC