Bug 7691 - MIXED_ES fires for emails in greek, possibly other languages too
Summary: MIXED_ES fires for emails in greek, possibly other languages too
Status: RESOLVED INVALID
Alias: None
Product: Spamassassin
Classification: Unclassified
Component: Rules (show other bugs)
Version: 3.4.2
Hardware: All Linux
: P2 normal
Target Milestone: Undefined
Assignee: SpamAssassin Developer Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2019-02-21 16:34 UTC by Savvas Karagiannidis
Modified: 2019-03-20 22:21 UTC (History)
2 users (show)



Attachment Type Modified Status Actions Submitter/CLA Status

Note You need to log in before you can comment on or make changes to this bug.
Description Savvas Karagiannidis 2019-02-21 16:34:46 UTC
It has been noted in many different installations that many messages end up being marked as spam because of the test MIXED_ES. I live in Greece and in greek, characters like "ε" and "έ" are very common and it is natural for emails in greek to contain many instances of such characters.
It would be ok if this test had a low score, but current score is almost 4 (3.999 	3.284 	3.999 	3.284) which is way too high to be ignored!
This test would really only make sense for english language.
Comment 1 John Hardin 2019-03-01 20:44:33 UTC
You will probably get better results if you report this on the SpamAssassin Users mailing list. You will have to subscribe before you can successfully post.
Comment 2 Bill Cole 2019-03-20 22:21:53 UTC
This was discussed on the mailing list. See https://mail-archives.apache.org/mod_mbox/spamassassin-users/201903.mbox/%3cDE6D0718-18A5-45FE-99CF-F4BF493E7E05@billmail.scconsult.com%3e

Short version:

These additions have been committed to the rules:

     lang pl  score MIXED_ES  0.01
     lang cz  score MIXED_ES  0.01
     lang sk  score MIXED_ES  0.01
     lang hr  score MIXED_ES  0.01
     lang el  score MIXED_ES  0.01

Also, the score of the rule as calculated by RuleQA has declined by over 1 point in the past month, mitigating the effects of the rule matching ham.