Bug 3045 - need a clean-text test of all text attachments
Summary: need a clean-text test of all text attachments
Status: RESOLVED DUPLICATE of bug 1527
Alias: None
Product: Spamassassin
Classification: Unclassified
Component: Rules (show other bugs)
Version: 2.63
Hardware: Other other
: P5 normal
Target Milestone: 3.0.0
Assignee: SpamAssassin Developer Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-02-13 16:11 UTC by David Muir Sharnoff
Modified: 2004-03-06 08:46 UTC (History)
0 users



Attachment Type Modified Status Actions Submitter/CLA Status
my attempted rules text/plain None David Muir Sharnoff [NoCLA]
example spam text/plain None David Muir Sharnoff [NoCLA]

Note You need to log in before you can comment on or make changes to this bug.
Description David Muir Sharnoff 2004-02-13 16:11:33 UTC
In a document with multiple text parts, I need a rule that can 
match against the cleaned-up text in all the parts. 
 
The document I'm matching uses quoted-printable encoding.  I'm 
trying to match something in the 2nd text attachement.  I haven't 
found a way to do that where the text has been cleaned up from the 
quoted-printable mess. 
 
This is important because line breaks can be inserted into quoted 
printable text at any point. 
 
For those that are curious, what I'm trying to catch are the clever 
spammers who send an html document that has the text slightly encrypted 
and a javascript program (always the same) that decrypts the text. 
 
I'll attach both the rules I'm trying and an example of the spam. 
 
-Dave
Comment 1 David Muir Sharnoff 2004-02-13 16:12:20 UTC
Created attachment 1774 [details]
my attempted rules
Comment 2 David Muir Sharnoff 2004-02-13 16:12:44 UTC
Created attachment 1775 [details]
example spam
Comment 3 Daniel Quinlan 2004-02-14 14:04:53 UTC
Can you precisely define "the cleaned-up text"?
Comment 4 David Muir Sharnoff 2004-02-14 14:43:52 UTC
By "cleaned up text" I mean undoing the mime-encoding. 
 
Quoted-printable is particularly annoying because you can have 
line-breaks at any point so trying to match something that is  
quoted-printable encoded is nearly impossible. 
 
Comment 5 Theo Van Dinter 2004-02-15 15:33:17 UTC
Can you test out how 3.0.0 works for you?  It has a new MIME parser which should 
handle all of the "what do I decode?" issues.

If 3.0.0 solves the issue, this ticket can be closed as a dup of 1527.
Comment 6 Theo Van Dinter 2004-03-06 17:46:39 UTC
hearing nothing, I'm going to say that 3.0.0 should take care of the issue.

*** This bug has been marked as a duplicate of 1527 ***