The best of both worlds - a hybrid approach?
Posted Feb 23, 2006 10:06 UTC (Thu) by nix
In reply to: The best of both worlds - a hybrid approach?
Parent article: The Grumpy Editor's guide to bayesian spam filters
The developers agree. As of SA 3.2, the Bayesian scores are fixed and not trained by the perceptron.
The perceptron was persistently choosing overly low scores for the Bayesian filters, because *when SA's static regex rules work well*, choosing low scores for the high-probability Bayesian learner hits does indeed minimize FPs, as genuine spams tend to hit large numbers of static regex rules as well --- but those rules work less and less well after SA's release, and the perceptron cannot take that into account.
Hence the hardwiring of the Bayesian scores henceforward.
to post comments)