SA Bugzilla – Bug 2397
DISGUISE_PORN needs some additions
Last modified: 2005-01-10 16:42:10 UTC
Been noticing a few more techniques used for disguising porn words. One is using spaces between the words, and another is using FONT tags between some of the letters to try to fool filters. Of course, with the latter, the "body" tests will strip the tags so that it doesn't get fooled, but since SpamAssassin doesn't test for NORMAL dirty words, it doesn't get caught. Therefore, spam should be punished for using this technique. I'll attach an example spam and a solution in a minute...
Created attachment 1305 [details] Some perverted spam...
Created attachment 1306 [details] Some extra DISGUISE_PORN rules Not in patch form for right now. Also, I've included some eval tests with some commented Perl code. The variables aren't in the proper format, since I haven't deciphered EvalTests.pm yet, but it should be easy for somebody to convert it into the corrent variables. Also, I was going to add another one that caught words like "f uck", but I'm not sure if the "body" rule will translate HTML elements like that into a space or not. Anybody know the answer? (The documentation wasn't completely clear on that one.)
Subject: Re: [SAdev] DISGUISE_PORN needs some additions > Also, I was going to add another one that caught words like "f uck", but > I'm not sure if the "body" rule will translate HTML elements like that into a > space or not. Anybody know the answer? (The documentation wasn't completely > clear on that one.) it will.
Created attachment 1724 [details] Patch Figure I go ahead and take care of this one...
moving accuracy and some bugs to 3.1.0 milestone
more accuracy and performance bugs going to 3.1.0 milestone
reassigning to me
Brendan, Can you please submit a Contributor License Agreement to the ASF? http://www.apache.org/licenses/ Thanks. Daniel
thanks for the CLA (you gotta let me know when it's submitted so I know to check), closing bug