Spamassassin sa-learn Missing Messages
OS: 9.0 SA Version 2.55 (downloaded from SuSE archives) I fed sa-learn a single file with over 6000 messages but the output indicated it only accepted one message! I fed it a directory full of ham containing many thousands of messages but it only accepted 426! I tried creating a new spam.log file containing a few instances of know spam but it accepted non of them. My MTA is postfix 2.0 and I am invoking spamassassin via the anomy script file. There is, of course, no bayesian filtering taking place. I have scanned to SA docs and find nothing about mail file formats it works/doesn't work with. Any suggestions? Thank you, Lucky Leavell
I figured this one out: use --mbox instead of --file or --dir. On Mon, 10 May 2004, Lucky Leavell wrote:
OS: 9.0 SA Version 2.55 (downloaded from SuSE archives)
I fed sa-learn a single file with over 6000 messages but the output indicated it only accepted one message!
I fed it a directory full of ham containing many thousands of messages but it only accepted 426! I tried creating a new spam.log file containing a few instances of know spam but it accepted non of them.
My MTA is postfix 2.0 and I am invoking spamassassin via the anomy script file. There is, of course, no bayesian filtering taking place.
I have scanned to SA docs and find nothing about mail file formats it works/doesn't work with.
Any suggestions?
Thank you, Lucky Leavell
-- Check the headers for your unsubscription address For additional commands send e-mail to suse-linux-e-help@suse.com Also check the archives at http://lists.suse.com Please read the FAQs: suse-linux-e-faq@suse.com
OS: 9.0 SA Version: 2.55 (from SuSE archive) I have ample ham and spam counts (6,000 spam, 38,000 ham) but SA is still not using Bayesian analysis. I did manually set use_bayes to 1 in /etc/mail/spamassassin/local.cf though it is supposed to be the default but still no luck (based on the X-Spam-Status tests= line(s) in the message headers). Aside: I managed to get bogofilter working and use it via procmail but that is after SA processing. Thank you, Lucky
Lucky Leavell wrote:
OS: 9.0 SA Version: 2.55 (from SuSE archive)
I have ample ham and spam counts (6,000 spam, 38,000 ham) but SA is still not using Bayesian analysis. I did manually set use_bayes to 1 in /etc/mail/spamassassin/local.cf though it is supposed to be the default but still no luck (based on the X-Spam-Status tests= line(s) in the message headers).
Aside: I managed to get bogofilter working and use it via procmail but that is after SA processing.
Thank you, Lucky
Have you set a path to where you'd like the system-wide bayes database? Just as an example: bayes_path /home/mybayes You'll soon see whether the database is created and used. Second, you need to tell spamassassin what kind of maildir you have. I use a single file rather than a folder with separate messages, so I have to say: sa-learn --ham --mbox /path/to/ham sa-learn --spam --mbox /path/to/spam After that I run sa-learn rebuild. Admittedly, I've had to install a lot of third-party rulesets to keep the old spam detection rates high and upgrade to 2.63, but spamassassin has been working on tiptop form for a long time now and my only problem is the occasional false positive, usually from this list. :) Fish
On Tue, 11 May 2004, Mark wrote:
Lucky Leavell wrote:
OS: 9.0 SA Version: 2.55 (from SuSE archive)
I have ample ham and spam counts (6,000 spam, 38,000 ham) but SA is still not using Bayesian analysis. I did manually set use_bayes to 1 in /etc/mail/spamassassin/local.cf though it is supposed to be the default but still no luck (based on the X-Spam-Status tests= line(s) in the message headers).
Aside: I managed to get bogofilter working and use it via procmail but that is after SA processing.
Thank you, Lucky
Have you set a path to where you'd like the system-wide bayes database?
Just as an example: bayes_path /home/mybayes
This is probably it. I have only run sa-learn for my user account and only intend to run it for accounts with sufficient ham/spam history.
You'll soon see whether the database is created and used.
My database is in ~/.spamassassin/
Second, you need to tell spamassassin what kind of maildir you have. I use a single file rather than a folder with separate messages, so I have to say:
sa-learn --ham --mbox /path/to/ham sa-learn --spam --mbox /path/to/spam
I finally figured this one out after sa-learn reported it processed one message instead of the 6000 or so in the file!
After that I run sa-learn rebuild.
Did that...
Admittedly, I've had to install a lot of third-party rulesets to keep the old spam detection rates high and upgrade to 2.63, but spamassassin has been working on tiptop form for a long time now and my only problem is the occasional false positive, usually from this list.
Still using 2.55 downloaded from SuSE with default rulesets. I am far more concerned about false positives than an occasional missed spam message. Fortunately, most of these have been noncritical. Thank you, Lucky
Lucky Leavell wrote: [snip]
My database is in ~/.spamassassin/
[snip] Ah, well my understanding is that the bayes_path set in /etc/mail/spamassassin/local.cf is for a site-wde bayes db. If you are implementing it on a per user basis, then I guess you'll need to do something else. I don't know what as I only use a single, site-wide bayes db. Apologies, should have taken that into account when replying. :) Fish
You can use bogofilter directly in K-mail: one filter to add an extra header X-Bogosity: criterion: <size> > 1 action: pipe to bogofilter -ep and one filter to send the spam to trash: criterion: X-Bogosity contains yes action: move to folder trash bogofilter works like a charm with as few as 500 ham and 200 spam messages. Best regards, Jos. On Tuesday 11 May 2004 16:33, Lucky Leavell wrote:
OS: 9.0 SA Version: 2.55 (from SuSE archive)
I have ample ham and spam counts (6,000 spam, 38,000 ham) but SA is still not using Bayesian analysis. I did manually set use_bayes to 1 in /etc/mail/spamassassin/local.cf though it is supposed to be the default but still no luck (based on the X-Spam-Status tests= line(s) in the message headers).
Aside: I managed to get bogofilter working and use it via procmail but that is after SA processing.
Thank you, Lucky
-- Jos van Kan Reginastraat 1 2645 NM Delfgauw 015 2615 267
What about multiple pop accounts in Evolution?
CWSIV
On Wed, 12 May 2004 01:15:15 +0200 Jos van Kan
You can use bogofilter directly in K-mail: one filter to add an extra header X-Bogosity: criterion: <size> > 1 action: pipe to bogofilter -ep
and one filter to send the spam to trash: criterion: X-Bogosity contains yes action: move to folder trash
bogofilter works like a charm with as few as 500 ham and 200 spam messages.
Best regards, Jos.
On Tuesday 11 May 2004 16:33, Lucky Leavell wrote:
OS: 9.0 SA Version: 2.55 (from SuSE archive)
________________________________________________________________ The best thing to hit the Internet in years - Juno SpeedBand! Surf the Web up to FIVE TIMES FASTER! Only $14.95/ month - visit www.juno.com to sign up today!
On Tue, 2004-05-11 at 16:33, Lucky Leavell wrote:
OS: 9.0 SA Version: 2.55 (from SuSE archive)
I have ample ham and spam counts (6,000 spam, 38,000 ham) but SA is still not using Bayesian analysis. I did manually set use_bayes to 1 in /etc/mail/spamassassin/local.cf though it is supposed to be the default but still no luck (based on the X-Spam-Status tests= line(s) in the message headers).
I don't know if this is the problem, but I have found that with the later versions of Spamassassin, you _have_ to first teach it before bayes will work. I found this in SUSE 9.0 with default sa 2.55, and sa 2.63, as well as on Debian. I think I had to feed it around 200 spam and 200 ham. For ham I used posts from this lists, spam I had enough of. If you want I can send you my bayes files, it's about 400k bzipp2ed.
Aside: I managed to get bogofilter working and use it via procmail but that is after SA processing. would you please send me the relevant lines from your (.)procmailrc?
Thanks -- Hans du Plooy Newington Consulting Services hansdp at newingtoncs dot co dot za
participants (5)
-
Carl William Spitzer IV
-
Hans du Plooy
-
Jos van Kan
-
Lucky Leavell
-
Mark