Update on the Spam Front

So, a few weeks ago, I was getting fed up with the comment spam onslaught, and tried using captcha technology to shut it down. And it worked, which was great, but it also caused some problems for some readers, which was not so great. Pissing off readers is not something I want to do, so I dropped the captcha and am trying some new back-end spam filters instead.

Although the new spam filters seem to be working well so far, I’m a bit bummed about the outcome, because the captcha plug-in I was using — reCaptcha — is a bit more than your average spam deflector:

ReCaptcha is a rather clever service using them to help digitize books scanned into the Internet Archive as well. It’s a project from the School of Computer Science at Carnegie Mellon.

The Internet Archive is home to over 200,000 scanned copies of classic books. Some of them are gorgeously crafted, like this children’s book, but fancy styling can make it difficult for computers to translate the books into an indexable digital text. Much like a Mechanical Turk application, ReCaptcha uses humans to translate images of scanned words that a computer couldn’t understand.

I’d like to implement ReCaptcha at work instead of the captcha currently in place on the blog there, but it’s busy as hell right now; I probably won’t be able to get to it for a few more weeks.

Against The Grain

I’m of the camp that feels a blog is not a blog without comments. I also feel that there should be as few barriers to entry as possible when it comes to allowing comments. Easy interaction is a huge part of what has made the Internet great, and I’m a huge fan of it.

However, the damned comment spammers have made it impossible for me to run my blog the way I’d prefer. The vast majority of my older blog entries have had their comments closed to keep the spammers at bay, I’ve added all sorts of anti-spam plug-ins, and I’ve turned trackbacks off entirely.

And despite it all, especially in the last 48 hours, the comment spam has been overwhelming.

So today, and unwillingly, Captcha verification has been added to the blog comment form (Thanks Scott!). I really wish I didn’t have to, but it’s either that or go to some form of user registration / comment moderation, and on balance, I think Captcha is a little less of a burden all around.

To all you comment spammers out there who’ve forced me into this: You suck.

To the rest of you: I’m sorry. I hope it’s not too much of a hassle.