The Grumpy Editor's guide to bayesian spam filters
Posted Feb 23, 2006 9:42 UTC (Thu) by walterh
Parent article: The Grumpy Editor's guide to bayesian spam filters
I think that running SpamAssassin with the network tests enabled is unfair and invalidates the statistic. After all, your mail was collected over some time and only then fed to SpamAssassin. So in the meantime, all the network databases that SpamAssassin queries already had the spam from your set marked by other users. This is very different to the situation where you pipe your incoming spam through SpamAssassin in real-time, because then most spam isn't marked yet.
to post comments)