Keeping spamassassin current
[Posted March 3, 2004 by corbet]
Longtime users of
SpamAssassin know
that it can do an outstanding job of identifying spam. They also know,
however, that the effectiveness of any particular SpamAssassin release
tends to decline over time as spammers figure out how to craft messages
which get past the rules. The Bayesian filter buried inside SpamAssassin
can help a lot; it catches a fair amount of spam which evades the rules,
and it evolves over time to keep up with what the spammers are doing -
especially if you make a point of training the filter with its mistakes.
Even so, frustrating amounts of spam can get through.
The situation is not helped much by the fact that the SpamAssassin rule
base seems to be evolving slowly in recent times. The SpamAssassin
developers have too many other things to do, perhaps, or maybe they would
rather see the work done by the filter. In any case, some users would
certainly like to see the rules updated more frequently.
The maintenance of an up-to-the-second set of SpamAssassin rules could well
be a business opportunity for somebody, if the licensing issues could be
worked out. But SpamAssassin users should also be aware of the custom rulesets
page hosted on the SpamAssassin Wiki. This is a place where additional
rules can be found to deal with specific problems; some of them might cut
your spam load considerably.
Currently available rulesets include:
- One aimed at "pill spam." Those of us not looking to fill our
prescriptions over the net may welcome this one.
- "Bigevil" simply contains URLs found in spam; it's a sort of
content-based blacklist.
- There is a set of rules for filtering out virus warnings.
- "Tripwire" looks for combinations of letters which do not appear in
English text, normally.
Several others exist as well; there is also a "RulesDuJour" script which
can be used to automatically keep up to date with the rulesets as they are
maintained. The custom rulesets won't solve the spam problem, but they can
help to keep a mailbox a bit cleaner.
(
Log in to post comments)