SA Bugzilla – Bug 1934
Catch repeated obfuscating comments
Last modified: 2021-04-18 09:39:25 UTC
# Notes: 6 is an aribitrary number. Less may work better # Also, if false positives, we might want to pair this with OBFUSCATING_COMMENT # in a meta test, but I don't think it's needed rawbody RepeatedComment /<\!([^>]*)>.+<\!\1>.+<\!\1>.+<\! \1>.+<\!\1>.+<\!\1>/is score RepeatedComment 2 describe RepeatedComment Same HTML comment was repeated 6+ times
Subject: Re: [SAdev] New: Catch repeated obfuscating comments > Notes: 6 is an aribitrary number. Less may work better Also, if false > positives, we might want to pair this with OBFUSCATING_COMMENT in a > meta test, but I don't think it's needed Very interesting idea! We should also try putting some perl code to catch repeated comments in HTML.pm. Try both repeated in sequence and repeated throughout the entire message. It will probably be cheaper and perhaps more accurate than using a backtracking rawbody test. Daniel
I suppose we should remove the !s from my test, to fit with the new style spams I have seen that format comments <like this>. Also, I like the idea of coding this test in the Perl code. Other ideas are to count one-long-word comments, or comments with random non- letter characters in them.
move bug to Future milestone (previously set to Future -- I hope)
Checked this out for fun. Doesn't make any sense with todays HTML mail. Closing.