LWN.net Logo

Keeping spamassassin current

Keeping spamassassin current

Posted Mar 4, 2004 11:35 UTC (Thu) by larsga (guest, #2801)
Parent article: Keeping spamassassin current

Due to the volume of spam I receive (typically ~800 in one weekend) I've tried out several different solutions, in this order: spamcop, bogofilter, spamassassin, and spambayes. Spamcop turned out not to catch more than ~20%; bogofilter caught a good deal, but not enough. SpamAssassin didn't do very well for me (and also produced quite a few false positives), so I ended up ditching it. SpamBayes, however, has pretty much solved the problem for me. I have to do a little training now and then to keep it up to date, but that's about it.

In short, instead of trying to keep SpamAssassin current you might as well ditch it in favour of SpamBayes. All IMHO, of course.


(Log in to post comments)

spambayes (was Keeping spamassassin current)

Posted Mar 4, 2004 12:48 UTC (Thu) by metacircles (guest, #8895) [Link]

I've also been pretty impressed by spambayes. Since I last cleaned
out my spam folder (six days ago) I've received 8488 spams (about 43Mb), and maybe ten to twenty a day have leaked through it (I don't keep stats on them, I just train and/or delete them as I see them. It feels like about that many)

I have no idea about false positives because I lack any kind of motivation to ever check the spam folder: if people really want to get in touch with me I can only suggest they (a) write mail that looks less like spam, or (b) use a reliable contact method.

Is my spam filter false positive the sender's problem?

Posted Mar 6, 2004 1:39 UTC (Sat) by giraffedata (subscriber, #1954) [Link]

It's funny that in discussions about email systems, people quite often talk like it's a privilege to send email to them; "it's OK if it's hard to send me an email because the senders who don't play my game don't deserve to reach me."

I guess that's probably true for some recipients, but for most of us, we are probably hurt just as much by a failed email as the sender. I frequently get requests from people for help with software and documents I distribute, and my reply gets rejected because my mail server is erroneously on some black list or a name server isn't set up exactly according to convention, or something like that. And those are just the ones where the spam filter is courteous enough to send a bounce message; there are probably more I don't know about.

It irritates me if I spend 15 minutes writing an email and the recipient never gets it, but the real loser in this case is the recipient. And no, I don't go out of my way to find a way to get through the spam filter in these cases. There's just not enough in it for me.

Is my spam filter false positive the sender's problem?

Posted Mar 10, 2004 1:35 UTC (Wed) by showell (subscriber, #2929) [Link]

An interesting viewpoint but I must state the other side of the coin. In my job e-mail is out-dated but unfortunately not yet out-moded. I get no spam sitting behind a very good corporate firewall but the amount of real mail is so large that I cannot guarentee to read all of it.

I get annoyed with people who consider sending an e-mail as a way of dealing with a subject (ie if I have sent an e-mail then I have fulfilled my responsibility and have informed / handed off responsibility). No-one should consider a sent e-mail as read, even if we had zero spam on the net.

signed
Frustrated Manager

Keeping spamassassin current

Posted Mar 4, 2004 16:24 UTC (Thu) by RobSeace (subscriber, #4435) [Link]

I'll throw in another recommendation for SpamBayes... I use it on my home
E-mail address, which gets about 200 spams per day, and it works incredibly
well... And, it has a very difficult time there, because I get a lot of
particularly spammy-looking mail, which I actually WANT to receive there...
(Eg: receipts and shipping notices from online ordering places, various
mailing list posts, and even a few actual ads from businesses I really want
to see...) And, it still does a remarkable job of sorting the spam from the
non-spam... Every once in a while, it'll mess-up, but it's really pretty
rare, and on the whole it definitely makes my life a whole lot easier...

However, I have to disagree with you about bogofilter... I use that on my
work E-mail address, which gets a bit less spam (maybe 100-150 per day), and
it works just as great as SpamBayes, in my experience, if not even BETTER...
However, to be fair, it DOES have a much easier job, too: the only legit
messages I ever get at work are actual work-related messages, and never any
borderline spammy-looking messages like SpamBayes has to deal with at home...
So, perhaps that accounts for my perception of it... *shrug* All I can say
is that it seems nearly perfect under these conditions, anyway... I've
never even ONCE had a false-positive with it... And, it doesn't let many
spams ever slip through, either... And, it seems to learn INSTANTLY when
corrected, whereas my experience with SpamBayes is that it takes a while to
learn from mistakes, and may keep repeating them a few times before finally
catching on...

But, in any case, I recommend both SpamBayes and bogofilter quite highly...
People keep talking about the need for all these complex and silly solutions
to the scourge of spam, when really all they need to do is install a Bayesian
filter and train it well, and they'd not have many complaints after that...
(Of course, I wouldn't object to drawing and quartering of spammers, if
that were to be passed into law, either... ;-))

Keeping spamassassin current

Posted Mar 5, 2004 1:24 UTC (Fri) by larsga (guest, #2801) [Link]

I'm not sure whether my experience with bogofilter is generally applicable, since I had to set up my own scripts to get email from my mail client into the bogofilter database. It may be that clues (headers especially) get lost on the way, so it *is* possible that bogofilter actually is better than I made it sound like.

On the other hand, when I used it *had* to set up my own scripts, whereas with SpamBayes' POP proxy there was no need for that, so SpamBayes gets some extra points anyway. :)

Copyright © 2008, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds