SA Bugzilla – Bug 4794
is_charset_ok_for_locales() may be too generic
Last modified: 2007-02-17 02:45:17 UTC
I've configured: ok_locales en fr (or even just "en") and I notice that messages written in Turkish, Cyrillic, Greek, etc. all get through just fine even though my locales are English or English and French. Apparently the sieve for language tests is too granular. I'm thinking that in "en", the rule that should apply is the following: * the USASCII charset is fine; * all 7-bit characters are fine; * the 8-bit characters in ISO8859-1 should be fine (if we want to be extra liberal); * the non-accented characters in ISO8859-[2-4] should be fine (section, non-breaking space, etc); And either anything else should fail the test, or else a small percentage (like less than 0.5%) of accented characters from these "border line" character sets should pass but anything more fail (since someone might send a message in English, but write their name or signature in Greek or Russian or whatever).
This RFE would be taken care of by fixing bug 4078, so closing as a dupe *** This bug has been marked as a duplicate of 4078 ***