Carlos E. R. wrote:
The Wednesday 2004-02-18 at 20:30 -0900, John Andersen wrote:
Anything that gets thru spamassassin but IS spam I manually move to a folder I created called missedspam
Then every midnight a cronjob runs sa-learn against that folder and then deletes the contents. That trains the bayes filters and they are getting pretty good at spotting those.
I know that, and I do that; but it is useless for this kind of spam, it is designed to fool bayesian filters. This is one of them, see how they look:
bogofilter catches these examples you sent as spam. I'm finding it a little tricky to set up, but it's showing itself to be very efficient. Even with blocks of words like that, you have to realize that it's still evident that it is spam. After all, these spam messages do not contain small words or ordinary words. They also do not contain works that I would use or receive in email. They are simply dictionary dumps. What you end up with is the assumption that everything from the dictionary is spam unless it is in the much smaller list of known good words, like suse, linux, rpm for this list.