Another paper
Another paper
Posted Sep 12, 2002 13:17 UTC (Thu) by jrennie (guest, #3655)In reply to: Another paper by armijn
Parent article: Spam avoidance techniques
FYI, k-nearest neighbors (kNN) is very slow compared to filtering by rules or Bayesian approaches (like Graham describes, bogofilter and ifile). For each message you want filtered, kNN compares that message to all messages in the training database. So, filtering n messages is O(nm) (where m is # of training messages). Bayesian approaches scale as O(n).
Jason Rennie
Author of ifile - the original intelligent e-mail filter
