Home Accessibility Courses Twitter The Mouth Facebook Resources Site Map About Us Contact
 
Python and Tcl - public course schedule [here]
Private courses on your site - see [here]
Please ask about maintenance training for Perl, PHP, Lua, etc
 
Keeping forum and blog comments clean

We're all getting far too used to having to type in a word that's shown in an image, to answer a multiple choice question, to do a sum and type in the answer when we want to post to / comment on an article on a web site. And sometimes those images are quite hard to make out - indeed they seem designed to be the reverse of accessible!

Question ... "Why are the words at the bottom of the page so hard to decipher? Why are they needed at all? We are not on some Nationally sensitive site."

Everyone who runs a site that welcomes public comment needs to have some sort of protection and strategy against contributions by people who are known as "forum spammers". People who will contribute to a site, but off topic, with material at best dilutes the site and at worst causes real offence ... and they'll do it to advertise their own products. This web site you're reading at the moment has a peak traffic of over 250 visitors per hour in the middle of a weekday, and if an advertiser can sneak in his product (or, often, scam) onto a reputable site it will give it "street cred" and also help - by association - in search engine results - Search engines work along the lines of "it this is approved of by lots of reputable sites, then we should approve it more".

Does this effect even small new sites like our Melksham SCOB [Campus] site, where the question was asked? Yes - I don't think I'm giving anything away here - the very first comments were along the lines "What a fascinating site. Have you seen this probuct [link]". The obvious follow-up question is "Why not simply delete these contributions" ... but the answer is that they come too thick and fast; we have to have a mechanism that's prevention rather than cure.

There are two strategies to overcome forum spam. The first is to require all users to sign up, agree to terms and condtions, make some checks to be pretty sure that they're genuine, and then let them loose. This is what we use on a site that I look after as part of my campaign for an improved rail service for Melksham - see [here] for the registration page. It's excellent for a site where the operator anticipates regular contributions from the same people, where a continuity of submissions is useful, and where newcomers won't be too put off by the hurdles and intial wait to write their first contribution. The second is to check every post / contribution as it's made - yes, that involves repeated security checks that may be a bit irritating for the contributor - but it does get over that major hurdle of loosing a high proportion of potential contributiors because of sign up delays before they can even write anything.

That's given you an overview of why we need to protect against forum spam. The figures are huge; if you look at the Project Honeypot site you'll find figures in the millions, and if you look at the Stop Forum Spam site, you'll find that the whole front page is a list of spammers reported within the last minute!

Answering, now, the first part of the question. The words have to be hard to decipher to make it difficult for automated programs to do it - and character recognition is a very well developed science these days. If you can read it easily, then it's probable that a program can. And once you get programs generating spam, based on a pattern and sending it out to large lists of possible target sites, you're in a very interesting "game" indeed.
(written 2012-03-19, updated 2012-03-24)

 
Associated topics are indexed as below, or enter http://melksh.am/nnnn for individual articles
G909 - Well House Consultants - Spam, Spamming and Spammers
  [4520] No cold sales calls please - but delighted to hear from others! - (2015-09-29)
  [4315] Welcoming genuine forum posters quickly - but turning away off topic advertisers - (2014-11-16)
  [4135] Introducing your product to Well House Consultants - single, personally tuned email please - (2013-07-08)
  [3946] Moving from a warning system to a control system - PHP, forum spammers - (2012-12-07)
  [3912] Sand to Arabia, Coals to Newcastle or Woodburners to Russia - (2012-11-04)
  [3910] Identifying your real customers and keeping them well informed fast - (2012-11-02)
  [3506] Cold call contacts - preference services and turning off spam sales approaches - (2011-11-03)
  [3352] World Trade Register - Certainly NOT worth 2985 Euros. - (2011-07-09)
  [3316] Twitter Phishing Trips ... and a great new alert service - (2011-06-04)
  [3190] What do the following web sites have in common? - (2011-03-03)
  [3166] Well house is strong - confirmed? - (2011-02-11)
  [3016] The legal considerations of your web presence - revisited - (2010-10-26)
  [2884] Hotlinked images onto adult material sites - (2010-07-23)
  [2697] Email metrics and filtering - (2010-03-28)
  [2398] Websitemediasolution and a goldfish called Carl Johnson - (2009-09-06)
  [2276] Who is Marc Schneider of Multilingual Search Engine Optimization Inc - (2009-07-10)
  [2179] Offers that I can refuse - (2009-05-12)
  [2177] Preventing forum spam - checks at sign up - (2009-05-12)
  [2019] Baby Caleb and Fortune City in your web logs? - (2009-01-31)
  [1978] From spam to mod_alias - finding resources - (2009-01-05)
  [1817] Marc Schneider is still having email trouble - (2008-09-30)
  [1763] Co-operating to save, yet we dont - (2008-08-21)
  [1532] Comment spam blocked. Please comment via Forums - (2008-02-05)
  [1523] Ive just received an email from myself. Should I be worried? - (2008-01-29)
  [1115] Unexpected visitors to our site - (2007-03-22)
  [1037] Impact Engineering and Backscatter - (2007-01-16)
  [872] Email metrics - (2006-09-20)
  [495] More spam - a success story - (2005-11-13)
  [417] Telephone Preference Service - we're registered - (2005-08-17)
  [347] Frightening and from-friend viruses and spams - (2005-06-14)
  [338] OO techniques are hard to teach - (2005-06-06)
  [276] An apology to Mr Boneparte - (2005-04-11)
  [268] Information request forms, cleaning up spam - (2005-04-05)
  [259] Responding to spam - (2005-03-27)


Back to
A Pivotal Incident - learning how to welcome your guests
Previous and next
or
Horse's mouth home
Forward to
Finding all the unique lines in a file, using Python or Perl
Some other Articles
Will will smile?
Error checking in a Python program - making your program robust via exceptions
Changing shops and organisations - Melksham, the last and next five years
Finding all the unique lines in a file, using Python or Perl
Keeping forum and blog comments clean
A Pivotal Incident - learning how to welcome your guests
Welcome to Melksham - our new communities
Using Make for a distribution
Basham Festival, Melksham, early August 2012 - a welcome
TrainWest 2012 - 14th and 15th April, Melksham, Wiltshire
4759 posts, page by page
Link to page ... 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96 at 50 posts per page


This is a page archived from The Horse's Mouth at http://www.wellho.net/horse/ - the diary and writings of Graham Ellis. Every attempt was made to provide current information at the time the page was written, but things do move forward in our business - new software releases, price changes, new techniques. Please check back via our main site for current courses, prices, versions, etc - any mention of a price in "The Horse's Mouth" cannot be taken as an offer to supply at that price.

Link to Ezine home page (for reading).
Link to Blogging home page (to add comments).

You can Add a comment or ranking to this page

© WELL HOUSE CONSULTANTS LTD., 2019: 404 The Spa • Melksham, Wiltshire • United Kingdom • SN12 6QL
PH: 01225 708225 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho

PAGE: http://www.wellho.net/mouth/3661_Kee ... clean.html • PAGE BUILT: Sat May 27 16:49:10 2017 • BUILD SYSTEM: WomanWithCat