LWN.net Logo

The Grumpy Editor's guide to bayesian spam filters

The Grumpy Editor's guide to bayesian spam filters

Posted Feb 23, 2006 13:54 UTC (Thu) by glouis (guest, #526)
Parent article: The Grumpy Editor's guide to bayesian spam filters

Disclaimer: I'm a bogofilter developer and the original author of the bogotune utility.
Bogofilter works fairly well out of the box, with minimal training, as you saw. With careful tuning, particularly of the spam and nonspam cutoff values, and a rather larger amount of training, results like 0.5% false negatives and <1 in 150,000 false positives are attainable. See the papers at http://www.bgl.nu/bogofilter/ for details. The work involved is nontrivial but rewarding.


(Log in to post comments)

The Grumpy Editor's guide to bayesian spam filters

Posted Feb 24, 2006 19:39 UTC (Fri) by Dom2 (guest, #458) [Link]

My main problem with bogofilter is its use of berkeley DB. Today I've finally switched to sqlite instead and I hope that this will stop the seemingly neverending database corruptions that I kept experiencing. Apart from that, bogofilter's a pretty useful little tool.

-Dom

Copyright © 2013, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds