(statistically) biased tests?
Posted Sep 12, 2002 17:49 UTC (Thu) by
bockman (guest, #3650)
Parent article:
Spam avoidance techniques
For the little I know of statistic filters, if you train a filter with a set
of data, then you should not use the same set of data to evaluate how good the trained filter is ( since you are testing on the training data, the
filter obviously shows good results ).
A better test was maybe to train the filter with half of the data set and then test it with the other half.
(
Log in to post comments)