SA Bugzilla – Bug 3709
cannot use bayes on this message; db not initialised yet
Last modified: 2004-08-20 09:19:17 UTC
I don't think bayes filtering is occuring but I can't tell why. I've looked at the code (not a perl expert!). The sorted array is empty for some reason. I've put some dbg statements around the next statement. Do I not have enought "tokens" to be processed (see nham/nspam)? In Bayes.pm around line 1077. dbg ("before check $pw"); next if (abs($pw - 0.5) < $self->{robinson_min_prob_strength}); dbg ("after check $pw"); push (@sorted, $pw); spamassassin -D --lint returns: debug: running body-text per-line regexp tests; score so far=3.917 debug: bayes corpus size: nspam = 1094, nham = 547 debug: uri tests: Done uriRE debug: tokenize: header tokens for *F = "U*ignore D*compiling.spamassassin.taint.org D*spamassassin.taint.org D*taint.org D*org" debug: tokenize: header tokens for *m = " 1093038706 lint_rules " debug: in loop 150 debug: before check 0.82472161549557 debug: in loop 149 debug: before check 0.776288474810214 debug: in loop 148 debug: before check 0.386300929839391 debug: in loop 147 debug: before check 0.429654879773692 debug: in loop 146 debug: before check 0.4952517093434 debug: cannot use bayes on this message; db not initialised yet debug: bayes: not scoring message, returning 0.5 debug: bayes: 19548 untie-ing debug: bayes: 19548 untie-ing db_toks debug: bayes: 19548 untie-ing db_seen
"debug: cannot use bayes on this message; db not initialised yet" means that the tokens that were found in the message weren't found in your bayes db or that they weren't polarized enough to be useful, so SA can't generate a probability for the message. BTW: In 3.0, the debug message was changed to a more descriptive: cannot use bayes on this message; none of the tokens were found in the database