LWN.net Logo

The best of both worlds - a hybrid approach?

The best of both worlds - a hybrid approach?

Posted Feb 23, 2006 10:06 UTC (Thu) by nix (subscriber, #2304)
In reply to: The best of both worlds - a hybrid approach? by corbet
Parent article: The Grumpy Editor's guide to bayesian spam filters

The developers agree. As of SA 3.2, the Bayesian scores are fixed and not trained by the perceptron.

The perceptron was persistently choosing overly low scores for the Bayesian filters, because *when SA's static regex rules work well*, choosing low scores for the high-probability Bayesian learner hits does indeed minimize FPs, as genuine spams tend to hit large numbers of static regex rules as well --- but those rules work less and less well after SA's release, and the perceptron cannot take that into account.

Hence the hardwiring of the Bayesian scores henceforward.


(Log in to post comments)

Copyright © 2008, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds