Bug 1147 - Malformed UTF-8 character
Summary: Malformed UTF-8 character
Status: RESOLVED FIXED
Alias: None
Product: Spamassassin
Classification: Unclassified
Component: Rules (Eval Tests) (show other bugs)
Version: 2.43
Hardware: PC Linux
: P1 normal
Target Milestone: ---
Assignee: SpamAssassin Developer Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2002-10-23 20:59 UTC by Roberto
Modified: 2002-10-29 22:34 UTC (History)
1 user (show)



Attachment Type Modified Status Actions Submitter/CLA Status

Note You need to log in before you can comment on or make changes to this bug.
Description Roberto 2002-10-23 20:59:08 UTC
I have installed SA 2.43 and I am noticing a lot of:
Oct 23 23:32:19 filter1 spamd[16430]: Malformed UTF-8 character (unexpected 
continuation byte 0xa0, with no preceding start byte) in substitution iterator 
at /usr/local/lib/perl5/site_perl/5.8.0/Mail/SpamAssassin/EvalTests.pm line 
1706, <STDIN> line 348.  
Oct 23 23:32:19 filter1 spamd[16416]: Malformed UTF-8 character (unexpected 
continuation byte 0xa0, with no preceding start byte) in substitution iterator 
at /usr/local/lib/perl5/site_perl/5.8.0/Mail/SpamAssassin/EvalTests.pm line 
1706, <STDIN> line 160.  

SA is catching a lot of spam but this error is causing problems.

A cached faq page on the SA site says it is not a bug to worry about however 
Exim errors out when this occurs:

2002-10-23 23:45:50 184YvD-0003rs-00 failed to read delivery status for 
BOB@foo.com from delivery subprocess

RedHat 8
perl 5.8 (reinstalled to remove threading)
SA 2.43

Other than this problem SA is working wonderfully I hope there is a quick patch 
avaialable
Comment 1 lee 2002-10-24 11:16:24 UTC
I have seen this same behavior using spamassassin 2.50(devel) under mimedefang
with the same OS/perl version.
Comment 2 lee 2002-10-24 14:45:10 UTC
TEMPORARY WORKAROUND:

You can change your locale to regular en_US (en_US.iso885915) instead of
en_US.UTF-8 and this seems to fix the problem.  RedHat 8 defaults to using UTF-8
locales.  You can use locale_config if you have XFree86 installed, or you can
just edit your /etc/sysconfig/i18n file.
Comment 3 Justin Mason 2002-10-30 07:20:45 UTC
I've thrown in some pack()s into the code to de-UTF strings
before using tr///.  That should help.  now in CVS...
Comment 4 Justin Mason 2002-10-30 07:34:37 UTC
oh BTW, marking as FIXED.  If you run into more, hjust reply and this
bug will reactivate.