University Web Developers

University Web Developers

Recently we have begun to experience an increase in spam generated from some of our web (HTML) forms. How do you deal with this? I'm concerned that some solutions may be inaccessible (I.e. CAPTCHA).

Tags: captcha, forms, spam

Share Twitter

Reply to This

Replies to This Discussion

nice.. I was not aware that this was available.

Reply to This

I got the idea from someone on uwebd. It's worth twice the price of admission!

Reply to This

You're all wrong, you need to apply the Land Before Time method!
The W3C has an interesting piece discussing this issue.

Reply to This

We are experimenting with the recaptcha API for some of our forms. The audio feature of it was quite appealing. It may not be a system wide solution, but to us it was worth a look.
http://recaptcha.net/learnmore.html

Reply to This

Another interesting method I have seen is to add
enctype="multipart/form-data"

to your form and check for that when processing. I guess most bots don't know to change their encode type. Granted this is another short fix till they decide to do that.

The other ones I know of are mostly Javascript based which I won't use on the off chance they don't have it enabled.

Reply to This

Reply to This

I recently tried a concept that seems to work well. It's based on the fact that most bots only visit a form page once, so that it can harvest the form and all its field names, action URL, etc. This info about the form is then stored in their database somehow, and is then used to constantly hammer the action URL by sending values for all the fields.

(Before continuing, let me say that I took a pretty detailed look at my server logs, found IP addresses of bots sending form spam, and discovered that those IP addresses were not actually visiting the form page itself. They were only submitting to the action URL.)

So, knowing this, I created a method that logs the IP address of every visit to a form. So when you visit a web page with a form on it, your IP address is logged, and you now have permission to actually submit the form. Then I changed my form handling program to check the incoming IP address against that log, to make sure that it has permission to submit the form.

In other words, you're not allowed to submit a form unless you actually visit the form web page first. Most bots do not visit the form web page first, and therefore most all form submissions are now coming from humans.

The main reason I like this solution is that it's all server-based. No javascript. No CAPTCHA. Nothing extra for users to do. And I got fantastic response from the form submission recipients. Form spam dropped to virtually zero!

Then came a problem. A number of visitors started reporting that they could not submit our forms. After investigating IP addresses, I found that they are all AOL users. It seems that AOL is doing some stuff with their IP address allocation, so that a user's IP address can actually change during their online session. This of course throws my whole theory out the window.

So, I currently have this whole thing turned off, and I'm hoping to figure out a way to still use it. I figured I'd post it here anyway, with the hope that maybe someone else can expand on it with a good idea.

Reply to This

Scott,

AOL uses proxy servers that make many users look like one IP address and, as you discovered, makes the same user look like different IP addresses sometimes.

You can check the referrer in the script for the action url to verify it came from the form page and achieve about the same amount of security without all the extra work; and avoid the AOL problem. Referrers can be spoofed, but so can IPs.

Reply to This

Most the bots I have dealt with are smart enough to spoof the referrer so that method hasn't been that fruitful for me.

Though Scott I think a better was to handle your situation is instead of putting an IP address in your DB, make a random seed and put that in the DB and place it in a hidden field in the form. After the person submits the form delete it from the database (to remove the chance you later generate the same seed and the person can't submit) and now when the bots attempt to keep doing it they won't be able to.

I think that method would fix the issues had with the previous solution.

Reply to This

Interesting, very interesting Richard. That seems to make sense. I'm definitely going to think about that a bit.

Reply to This

Yeah, when I did my initial look into the form spam in my server logs, I found a lot of spoofed HTTP_REFERERs. Their initial harvesting process just logged the URL of the original form and simply sent that with the form submissions. I know that IP addresses can be spoofed too, but after banging heads with a couple of other IT minds, we figured that using IP address is more reliable (less spoofed).

I also know about AOL's caching system (as well as the ability for any ISP to do the same). But I only thought that the risk would be multiple visitors with the same IP address. What I didn't realize is that an AOL user doesn't use the same cache server throughout the life of a single online session, hence the possibility of changing IP addresses.

I still like my original concept of insuring that a user actually visits the form first before submitting. I like a couple other ideas posted here too.

Reply to This

I haven't had a chance to try it yet, but the honeypot method seems to be working pretty well for people.

http://www.modernblue.com/web-design-blog/fighting-spam-with-css/

Or the simple question method:

"Please enter the letter a in the box at left."
http://www.subtraction.com

I've also seen, "Is ice cream hot or cold?"

Unfortunately, I don't think I have the ability to do any of this on the server site within our CMS.

Reply to This

RSS

Elsewhere

Latest Activity

Teresa Botelo, Maggie Bardacke, Matthew Fuerst and 1 more joined University Web Developers
6 minutes ago
I avoid US based servers for personal information, and try to keep all PII on-campus or at least on Canadian servers. The requesting IP number in GA data could be personally identifying in some cases, but I'm not very concerned. I work at a small c…
1 hour ago
We're happy to include Canadian schools in the directory. Your school will be included during our weekly update on Monday.
3 hours ago
How about Canadian schools? University of Toronto Mississauga - Prospective Students http://www.facebook.com/UTMProspective
7 hours ago
Diane Weller added a discussion
I’m involved in a project in which I’m researching how IT organizations practice the tracking of people’s work time; what are some best practices and how do organizations establish guidelines for tracking time. With the right approach, there can be…
7 hours ago
Gill joined Lynn Zawie's group
Share your experiences using OmniUpdate CMS
7 hours ago
Daniel Cerman Implementing a website redesign on Ektron CMS400.NET.
22 hours ago
Bob Tupaj added a discussion
Is anyone out there using Luminis for your school portal? If so, do you have a portal administrator, portal manager, etc. position as a point of contact (e.g., creating channels, providing training and support for the different applications, serving…
yesterday
Southeast Missouri State University on Facebook
yesterday
Was considering the template at INOVAS via the Dept of Higher Ed until i read this: http://www.simpleapply.com/blog/section-508-department-of-educations-net-price-calculator/
yesterday
Oh oh... it would be very helpful to know what % of mobile phones are Web accessible.. with a data plan. Based on that information, marketing to/with the device would change considerably.
yesterday
Thanks for these great ideas! More?
yesterday
Mardy Wilson, Laura Jane Travis, John Tuttle and 3 more joined University Web Developers
yesterday
From the Gift Store
on Wednesday
on Wednesday
daniel thornbury added 2 photos to the album 'Websites'
on Wednesday
A place to discuss marketing tactics for admissions or development on the web in higher education, whether it be features on your website, blogs, boards, chat, email marketing, social networks, etc...
on Wednesday
Hopefully we can get together!
on Wednesday
For designers, developers, and others who work at Law Schools.
on Wednesday
annual conference for higher education professionals who are involved in any step of the strategy, design or development of an institution's/department's website
on Wednesday

UWEBD has been in existence for more than 10 years and is the very best email discussion list on the Internet, in any industry, on any topic

About

© 2010   Created by Mark Greenfield

Badges  |  Report an Issue  |  Privacy  |  Terms of Service

Sign in to chat!