Dealing with comment spam using Bayesian logic

In the past couple of months, I've begun to notice the occasional posting of 'comment spam' on my website. These have tended to include a short string of nonsense "mad-lib" style text, followed by a large number of offsite links. I currently utilize the tracker module to at least glance at every comment left on my website, so I eventually find this spam and manually delete it. However as the rate of this comment spam has increased, I've been looking for a better way to deal with it.

Not wanting to re-invent the wheel, I began by looking at Spamassassin and other free anti-spam tools. I had hoped to integrate one of these tools into Drupal, letting it do the actual work of deciding whether or not a given comment was spam. With further research, I found that this wasn't very workable as these anti-spam tools tended to be very mail-centric, looking at more than just the body of the email. Instead, I read up on using Bayesian logic, and ultimately decided it would be best to write a simple Bayesian filter in PHP.

Featured site: Of, By & For

Of, By and For is a Drupal site aimed at discussing ideas for political reform in the United States. The site was created by Mitch Kapor, founder of Lotus and of the Electronic Frontier Foundation, Joe Costello, who was one of the principals in the Howard Dean campaign, and Bart Decrem, who, among others, heads marketing for the Mozilla project. This Friday, September 24, the site will host its first live event, a conversation between Joe Trippi, the campaign manager during the Dean campiagn, and Mitch.

Developers: tag your themes and modules

This announcement is aimed at developers. Please take note that we have set up the DRUPAL-4-5 branch name. If you maintain a theme or module in the contributions repository, now is the time to make it compatible with Drupal 4.5. All 4.5 versions should be branched using the DRUPAL-4-5 branch name. Instructions about our CVS policy and about how to branch are available in the FAQ.

Featured site: Spread Firefox

Yesterday, September 14th, Firefox announced its 1.0 Preview Release. To celebrate that and to make the most out of the momentum Firefox has gained the last months, a new initiative was launched: Spread Firefox. Drupal users will immediately see that this initiative is built on top of Drupal. In fact is is built on top of CivicSpace, a distribution and service built with Drupal software and aimed at grassroots campaigning on internet.

New Drupal 4.4 review

Harold Jarche and I have written a review of Drupal commisioned by Business Blog Consulting. The reviews Rick is posting seem to be getting a fair amount of traffic and interest. Hopefully lots more people will consider and adopt Drupal after reading this review.

Drupal.org upgraded

We just finished upgrading drupal.org to CVS HEAD, the code that will become Drupal 4.5.0. So in good Drupal tradition, we (developers) get to eat our own dog food again, and you (users) get to see and test that latest changes.

This marks a significant upgrade, yet most of the changes will only be visible to this site's administrators. We are still fine-tuning the site, but please let us know if you find anything out of whack.

Pages

Subscribe with RSS Subscribe to Drupal.org RSS