Bug 3279 - spamd crashing?
Summary: spamd crashing?
Status: RESOLVED DUPLICATE of bug 3317
Alias: None
Product: Spamassassin
Classification: Unclassified
Component: spamc/spamd (show other bugs)
Version: SVN Trunk (Latest Devel Version)
Hardware: Other other
: P5 normal
Target Milestone: 3.0.0
Assignee: Theo Van Dinter
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: 3208
  Show dependency tree
 
Reported: 2004-04-17 03:06 UTC by Graham Murray
Modified: 2004-04-28 15:15 UTC (History)
1 user (show)



Attachment Type Modified Status Actions Submitter/CLA Status

Note You need to log in before you can comment on or make changes to this bug.
Description Graham Murray 2004-04-17 03:06:21 UTC
Since upgrading to 10051, I am seeing a lot of mails which do not have
spamassassin headers added. These are being passed to spamd by spamc, but
processing is not completing.  In each case that I have lokked at, the spamd
debug output stops after 'received-header: parsed as'. An example from the -D
output (this is all the output for this email) is

2004-04-17 09:35:34 [28354] i: got connection over /tmp/spamd.sock
2004-04-17 09:35:35 [28354] i: logmsg: info: setuid to graham succeeded
2004-04-17 09:35:35 [28354] i: info: setuid to graham succeeded
2004-04-17 09:35:37 [28354] i: debug: user has changed
2004-04-17 09:35:38 [28354] i: debug: using "/home/graham/.spamassassin" for
user state dir
2004-04-17 09:35:38 [28354] i: debug: bayes: 28354 tie-ing to DB file R/O
/home/graham/.spamassassin/bayes_toks
2004-04-17 09:35:39 [28354] i: debug: bayes: 28354 tie-ing to DB file R/O
/home/graham/.spamassassin/bayes_seen
2004-04-17 09:35:39 [28354] i: debug: bayes: found bayes db version 2
2004-04-17 09:35:44 [28354] i: debug: Score set 3 chosen.
2004-04-17 09:35:45 [28354] i: logmsg: processing message
<200404170935.i3H9ZT3N028755@home.gmurray.org.uk> for graham:500.
2004-04-17 09:35:45 [28354] i: processing message
<200404170935.i3H9ZT3N028755@home.gmurray.org.uk> for graham:500.
2004-04-17 09:35:46 [28354] i: debug: ---- MIME PARSER START ----
2004-04-17 09:35:46 [28354] i: debug: main message type: text/html
2004-04-17 09:35:46 [28354] i: debug: parsing normal part
2004-04-17 09:35:46 [28354] i: debug: added part, type: text/html
2004-04-17 09:35:46 [28354] i: debug: ---- MIME PARSER END ----
2004-04-17 09:35:49 [28354] i: debug: received-header: parsed as [
ip=217.169.20.13 rdns=A.hopeless.aaisp.net.uk helo=hopeless.aaisp.net.uk
by=home.gmurray.org.uk ident= envfrom= intl=0 ]
2004-04-17 09:35:52 [28354] i: debug: is Net::DNS::Resolver available? yes
2004-04-17 09:35:54 [28354] i: debug: looking up PTR record for '207.182.148.82'
2004-04-17 09:35:58 [28354] i: debug: PTR for '207.182.148.82':
'mail01-02.allright22ok33.com'
2004-04-17 09:35:58 [28354] i: debug: received-header: parsed as [
ip=207.182.148.82 rdns=mail01-02.allright22ok33.com
helo=mail01-02.allright22ok33.com!!207.182.148.82! by=hopeless.aaisp.net.uk
ident= envfrom= intl=0 ]
Comment 1 Dallas Engelken 2004-04-17 10:35:27 UTC
this looks identical to the problem i was seeing in 2.63 that i was not having
in 3.0 last i tested (r9952)

from the sa-talk list a couple days ago...

################
I'm having a problem with mail containing the following received header
from ever processing.   It takes 70-80 seconds for SA 2.63 to process it
which exceeds by spamc -t60 therefore it exits 74 and tempfails it.
This happens over and over all day long.   Testing this through 3.0-SVN
the message scans in just a couple seconds...  Is there an easy fix for this on
2.63, besides enabling -L for spamd (which works).

FROM 2.63
--------------------

# cat /tmp/message
Received: from e2kdfwmcs1.fhhlc.ftn.com ([10.40.34.13]) by 
e2kmemsmtp1.ftbco.ftn.com with Microsoft SMTPSVC(5.0.2195.6713); Thu,  15 Apr
2004 11:27:07 -0500

Hi

[root@mailgw tmp]# spamassassin -V
SpamAssassin version 2.63

[root@mailgw tmp]# time spamc -t60 < /tmp/message
Received: from e2kdfwmcs1.fhhlc.ftn.com ([10.40.34.13]) by 
e2kmemsmtp1.ftbco.ftn.com with Microsoft SMTPSVC(5.0.2195.6713); Thu,  15 Apr
2004 11:27:07 -0500

Hi

real    1m46.338s
user    0m0.000s
sys     0m0.000s

FROM 3.0 SVN
--------------------

[root@spamd4 tmp]# spamassassin -V
SpamAssassin version 3.0.0-r9952

[root@spamd4 tmp]# time spamc -t60 < /tmp/message
Received: from e2kdfwmcs1.fhhlc.ftn.com ([10.40.34.13]) by 
e2kmemsmtp1.ftbco.ftn.com with Microsoft SMTPSVC(5.0.2195.6713); Thu,  15 Apr
2004 11:27:07 -0500
X-Spam-Level: **
X-Spam-Status: No, hits=2.9 required=5.0

Hi

real    0m3.819s
user    0m0.000s
sys     0m0.000s

--------------------

when i was seeing the problem, I believe it is related to a non-existent domain
or a domain that doesnt not have DNS servers answering requests, therefore spamd
was waiting for a DNS timeout from the OS..  

At least when I was experiencing the problem with my received header hanging (by
watching spamd debug log), i could not do any lookups on NS's for ftn.com,
nmap'ing port 53udp on their NS's said filtered.  It looks like they might have
a valid NS in there now.

Is putting an eval {};  on the Received header DNS lookups in order to associate
a user-defined timeout worth having?   We have razor_timeout, pyzor_timeout,
dcc_timeout,  why not a dns_timeout?

 
Comment 2 Graham Murray 2004-04-18 02:02:13 UTC
I do not think it is a DNS lookup timeout problem. The latest example only took
28s from the time of the incoming connection to sendmail to the mail being
delivered by procmail.

I only started seeing this with the spamd which pre-forks. I never saw it with
the previous 3.0.0 versions of spamd.
Comment 3 Theo Van Dinter 2004-04-18 13:48:43 UTC
does the same message repeatedly have this behavior?
Comment 4 Graham Murray 2004-04-18 22:47:11 UTC
I saved one of the messages showing this problem and ran it through spamc
manually. I did this 5 times, 2 times it came back with X-SPAM headers, 3 times
it came back with no added headers.

Also looking at another message which failed, it is not always stopping at the
'Received-header parsed", At least one message stopped after "Running tests for
priority: 500" but again I do not think this is a timeout as the message was
only a few seconds into processing.
Comment 5 Dallas Engelken 2004-04-19 05:12:59 UTC
well, that ftn.com received header is doing the same thing again on 2.63 -
testing the same message as you see above in this bug with just the single
received header line and the word 'Hi' in the body... it causes spamc to return
after 60 seconds from -t60, and spamd finally finishes the processing in 1
minute and 36 seconds as you'll see by the debug log for PID 13883...

it hangs on the 'debug: received-header:' twice as you can see, for 51 seconds
on the first, and 52 seconds on the second... sure sounds like a dns timeout of
some kind to me since they are so close.

[root@mailgw root]# time spamc -t60 < /tmp/message
Received: from e2kdfwmcs1.fhhlc.ftn.com ([10.40.34.13]) by
 e2kmemsmtp1.ftbco.ftn.com with Microsoft SMTPSVC(5.0.2195.6713); Thu,
 15 Apr 2004 11:27:07 -0500

Hi


real    1m0.007s
user    0m0.000s
sys     0m0.000s


debug info where it was hanging..
-------------------------------
..
..
2004-04-19 07:01:42.954209500 debug: bayes: 13883 untie-ing db_toks
2004-04-19 07:01:42.954339500 debug: bayes: 13883 untie-ing db_seen
2004-04-19 07:01:42.955358500 debug: received-header: parsed as [ ip=10.40.34.13
rdns=e2kdfwmcs1.fhhlc.ftn.com helo= by=e2kmemsmtp1.ftbco.ftn.com ident= ]
2004-04-19 07:02:33.989726500 debug: received-header: 'from' 10.40.34.13 has
reserved IP
2004-04-19 07:03:25.979742500 debug: received-header: relay 10.40.34.13 trusted? yes
2004-04-19 07:03:25.980736500 debug: Message too short for language analysis
..
..
2004-04-19 07:03:29.361244500 logmsg: clean message (2.9/4.0) for root:99 in
106.4 seconds, 163 bytes.
2004-04-19 07:03:29.361246500 debug: cleaned up kid 13883, pool=5
---------------------

also, if spamc detaches, why does spamd keep processing?  should the timeout be
server side on spamd instead of client side on spamc?

Comment 6 Theo Van Dinter 2004-04-25 12:40:52 UTC
taking the tickets since they're all related to the new spamd...
Comment 7 Justin Mason 2004-04-28 15:51:53 UTC
btw, would suggest attaching a sample
Comment 8 Theo Van Dinter 2004-04-28 23:15:11 UTC
this looks exactly like the 3317 bug.

*** This bug has been marked as a duplicate of 3317 ***