Home Accessibility Courses Diary The Mouth Forum Resources Site Map About Us Contact
 
Python and Tcl - public course schedule [here]
Private courses on your site - see [here]
Please ask about maintenance training for Perl, PHP, Lua, etc
 
Web Page and HTML Spell Checker

Our web site consists of some 7000 different URLs - we aim to add one or two new pages every day; they may be technical tips on the blog, answers to questions on our forum, or just general information pages. And it's so easy to make typing mistakes - especially when there's no spell checker in the application that we're using!

This solution centre page provides you with the links to a web page spell checker that was written initially as a "test of concept" but has proven to be rather useful to us - it's certainly allowed me to find a few rather embarrassing typos that have been on our site for quite a while.

HOW DOES IT WORK?

The application reads in a complete file of HTML, a block of HTML from a textarea, or even a full URL (yes, you can use it to test a live web site too).

We use the PHP striptags function to remove all the tags (with extra code to remove Javascript too) and we then break up the page into a series of tokens - words (or wannabe words), punctuation and spaces.

The dictionary that's supplied with Linux and Unix operating systems is read into an associative array, and each incoming word from the HTML is the checked against that associative array (the keys are the words - done for efficiency). An extra file of our own acceptable words has been added too - the original dictionary is now a bit old and non-technical.

Many words have derivates - big, bigger, biggest for example - and if the initial match fails there's a series of derivative checks made too.

CAN I USE THE SPELLCHECKER?

You are very welcome to run a few pages through the spell checker on this site - but please be mindful of the resources and bandwidth you will be using; we've added an unpublished "cap" into the live script.

>> Run the spell checker
You are welcome to adopt and adapt our source code for use on your own web site, but we do ask you to refer other people back to this page if they want to use it too. You may NOT ask for payment from anyone for use of this script, you may NOT charge for implementing on anyone else's site, and you may NOT republish it.

>> View the source code
ENHANCEMENTS

This script could be enhanced to add facilities such as:

1. Suggestions of corrected spellings

2. Your own private dictionary

3. Logging in, authorisation, charging and accounting facilities

If you would like to learn PHP so that you can add these facilities, or if you would like to enquire about specific tailored training and help installing the code, please contact us.

See also Run the spell checker

Please note that articles in this section of our web site were current and correct to the best of our ability when published, but by the nature of our business may go out of date quite quickly. The quoting of a price, contract term or any other information in this area of our website is NOT an offer to supply now on those terms - please check back via our main web site

Related Material

String Handling in PHP
  [4072] Splitting the difference with PHP - (2013-04-27)
  [4071] Setting up strings in PHP - (2013-04-27)
  [3790] Solution looking for a problem? Lookahead and Lookbehind - (2012-06-30)
  [3789] More than just matching with a regular expression in PHP - (2012-06-30)
  [3788] Getting more than a yes / no answer from a regular expression pattern match - (2012-06-30)
  [3534] Learning to program in PHP - Regular Expression and Associative Array examples - (2011-12-01)
  [3516] Regular Expression modifiers in PHP - summary table - (2011-11-12)
  [3515] PHP - moving from ereg to preg for regular expressions - (2011-11-11)
  [3424] Divide 10000 by 17. Do you get 588.235294117647, 588.24 or 588? - Ruby and PHP - (2011-09-08)
  [3020] Handling (expanding) tabs in PHP - (2010-10-29)
  [2629] Curly braces within double quoted strings in PHP - (2010-02-09)
  [2238] Handling nasty characters - Perl, PHP, Python, Tcl, Lua - (2009-06-14)
  [2165] Making Regular Expressions easy to read and maintain - (2009-05-10)
  [2046] Finding variations on a surname - (2009-02-17)
  [1799] Regular Expressions in PHP - (2008-09-16)
  [1613] Regular expression for 6 digits OR 25 digits - (2008-04-16)
  [1603] Do not SHOUT and do not whisper - (2008-04-06)
  [1533] Short and sweet and sticky - PHP form input - (2008-02-06)
  [1372] A taster PHP expression ... - (2007-09-30)
  [1336] Ignore case in Regular Expression - (2007-09-08)
  [1195] Regular Express Primer - (2007-05-20)
  [1058] PHP Regular expression to extrtact link and text - (2007-01-31)
  [1008] Date conversion - PHP - (2006-12-26)
  [728] Looking ahead and behind in a Regular Expression - (2006-05-22)
  [716] Evaluating arithmetic expressions in configuration files - (2006-05-10)
  [642] How similar are two words - (2006-03-11)
  [608] Don't expose your regular expressions - (2006-02-15)
  [589] Robust PHP user inputs - (2006-02-03)
  [574] PHP - dividing a string up into pieces - (2006-01-23)
  [560] The fencepost problem - (2006-01-10)
  [558] Converting between acres and hectares - (2006-01-08)
  [493] Running a Perl script within a PHP page - (2005-11-12)
  [463] Splitting the difference - (2005-10-13)
  [422] PHP Magic Quotes - (2005-08-22)
  [337] the array returned by preg_match_all - (2005-06-06)
  [54] PHP and natural sorting - (2004-09-19)
  [31] Here documents - (2004-08-28)

resource index - PHP
Solutions centre home page

You'll find shorter technical items at The Horse's Mouth and delegate's questions answered at the Opentalk forum.

At Well House Consultants, we provide training courses on subjects such as Ruby, Lua, Perl, Python, Linux, C, C++, Tcl/Tk, Tomcat, PHP and MySQL. We're asked (and answer) many questions, and answers to those which are of general interest are published in this area of our site.

You can Add a comment or ranking to this page

© WELL HOUSE CONSULTANTS LTD., 2019: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01225 708225 • FAX: 01225 793803 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho

PAGE: http://www.wellho.net/solutions/php-web- ... ecker.html • PAGE BUILT: Wed Mar 28 07:47:11 2012 • BUILD SYSTEM: wizard