using libmilter, milter-spamc and spamassasin

Scot L. Harris webid at cfl.rr.com
Sat May 29 19:51:21 UTC 2004


On Sat, 2004-05-29 at 15:03, Hannes Mayer wrote:

> 
> Scot, so the bayesian filter doesn't learn automatically ?
> I mean, one needs to train it once again with the filtered spam ?
> Any details you can share about this are greatly appreciated!
> 
> Thank you,
> Hannes.

There is an auto learn mode which does part of the job.  From the stuff
I have read and my experience, it is a good idea to run identified spam
as well as ham messages through the sa-learn program to reinforce what
it learns automatically.  Also you will need to teach as spam those spam
messages that get through.  

Also the bayesian filtering does not actually start until the system has
learned from several hundred messages.  And by feeding it a good sized
sample of ham messages as well as identified spam messages you will
reinforce what you believe is spam and ham.  It seems to refine the
identification process very well.  

I dump identified spam to a holding folder and unflagged spam to
separate folder.  Then once a week or so I run sa-learn against those
folders and against my inbox which has all ham messages.  

Been doing this for awhile.  Currently I get a handful of spam each week
that gets through if that many.  Have not had any false positives in a
very long time.  

According to the docs feeding it steady diet of ham and spam will keep
spamassassin happy and keep up with new tricks that the spammers try.

 
-- 
Scot L. Harris <webid at cfl.rr.com>





More information about the users mailing list