| |||||||||||
| |||||||||||
Information request forms, cleaning up spam
We've been discovered! Or rather ... our brochure request form has been discovered, just like the comment submission form to this diary has been discovered, by "spam engines".
These "spam engines" locate web forms, then complete them with information about on line gaming, pharmacutical products, and other goods and services that we're not interested in. They're characterised by including a very high proportion of links - especially in text areas. I believe that they're hoping to find forms that will let them post information onto bulletin boards and other web sites .... How to deal with this nuisance? I've amended our information request form response script to compare the length of the text entered "raw" with the length of the text entered once "href" tags are stripped out ... and if it shrinks by a third or more, it's probably a spam. It's hard to be sure, so I'm now in a testing phase that simply marks the emails sent by the brochure request system. Code (In Perl) to accumulate the full and stripped lengths - run on each field of the form $full_length += length($value); $value =~ s/<a\s+href[^>]+>/ /ig; $stripped_length += length($value); Code that evaluates whether or not the posting is a spam $spamfactor = $full_length / $stripped_length; if ($spamfactor > 1.4) { $extraword = "SPAM"; } else { $extraword = "OK"; } Note that I have also initialised the $full_length and $stripped_length variables to 1 not 0, in case anyone (or any automata) submits a blank form (written 2005-04-05 06:32:03) Associated topics are indexed under G902 - Well House Consultants - Web site techniques, utility and visibility
Some other Articles
More to programming than just programmingDifferent course every day NULL in MySQL Free parking for short errands in Melksham Information request forms, cleaning up spam Searching security holes A beautiful place to live and learn Business practise, 2005 style 100% Training Harmony 1710 posts, page by page
Link to page ... 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35 at 50 posts per pageThis is a page archived from The Horse's Mouth at http://www.wellho.net/horse/ - the diary and writings of Graham Ellis. Every attempt was made to provide current information at the time the page was written, but things do move forward in our business - new software releases, price changes, new techniques. Please check back via our main site for current courses, prices, versions, etc - any mention of a price in "The Horse's Mouth" cannot be taken as an offer to supply at that price. Link to Ezine home page (for reading). Link to Blogging home page (to add comments). |
| ||||||||||
PH: 01144 1225 708225 • FAX: 01144 1225 707126 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho | |||||||||||