Home Accessibility Courses Diary The Mouth Forum Resources Site Map About Us Contact
Matching disparate referencing systems (MediaWiki, PHP, also Tcl)

Yes, we are Well House CONSULTANTS and do a bit of specialist coding ...

I have a requirement on my plate at present to write a piece of code for a customer that recognises cross reference codes within a document and turns them into links. And what makes the task quite difficult is that the references come from all sorts of different original sources, with varied formats some of which might even be identifiable in two ways.

We'll be using a regular expression based identification system, but how to make such a scheme logical, easy to follow, and easy to maintain in the future as new references and exceptions to the general rules get added? Well to start with, I'll be using the bunching technique I described last week to make individual regular expression easier to read, and to avoid the need to keep repeating subsection bundles of special characters. But there will be more to it ...

Spring, Summer, Autumn

Most of the cross reference codes will conform to a pattern, or a series of patterns, which can be identified fairly easily. I'll describe these as "summer" expressions, as that's the time of year that most people go on holiday, that places are crowded, and there's a maximum of facilities available for them.

For those who don't manage to catch the summer, there are autumn holidays - fewer people around, and special cases for those who have missed out on the summer; I'm going to describe a series of autumn matches for those references which have been missed by the main filters

Some of the URLs that form special references include an embedded main (summer) reference in them ... so that handling of them can't wait until the Autumn. So for this reason, we'll also provide early-bird spring holidays (or regular expressions) to ensure that it's the proper complete reference that's handled, rather than the embedded mainstream one.

And finally ... I understand there are special cases. We'll call those "snowdrops" - we'll allow them to be individually marked up within documents by the document provider, and they'll be extracted / handled ahead of spring.

A new idea? No - there's nothing much new in this world ... you'll see a similar concept used within expect, with the expect, expect_before and expect_after commands. "Look out for xxx, failing that yyy, failing that zzz". Tcl may be mature but it's still an inspiration!
(written 2009-05-19 09:08:12)

 
Associated topics are indexed under
Q110 - Object Orientation and General technical topics - Programming Algorithms
  [2617] Comparing floating point numbers - a word of caution and a solution - (2010-02-01)
  [2586] And and Or illustrated by locks - (2010-01-17)
  [2509] A life lesson from the accuracy of numbers in Excel and Lua - (2009-11-21)
  [2259] Grouping rows for a summary report - MySQL and PHP - (2009-06-27)
  [1949] Nuclear Physics comes to our web site - (2008-12-17)
  [1840] Validating Credit Card Numbers - (2008-10-14)
  [1391] Ordnance Survey Grid Reference to Latitude / Longitude - (2007-10-14)
  [1187] Updating a page strictly every minute (PHP, Perl) - (2007-05-14)
  [1157] Speed Networking - a great evening and how we arranged it - (2007-04-21)
  [642] How similar are two words - (2006-03-11)
  [227] Bellringing and Programming and Objects and Perl - (2005-02-25)
  [202] Searching for numbers - (2005-02-04)


Back to
Camera to record where a picture was taken
Previous and next
or
Horse's mouth home
Forward to
How you are (re)presented at an exhibition
Some other Articles
Excellent product, excruciating customer service. 3 Mobile Broadband
Copy writing - allowing for the cut
RT @brento - a valuable source for the twitter newbie
How you are (re)presented at an exhibition
Matching disparate referencing systems (MediaWiki, PHP, also Tcl)
Camera to record where a picture was taken
Are we IITT (Institute of IT Training) members?
An FAQ on the Apache httpd and Apache Tomcat web servers, and on using them together
Abstract Classes - Java
Choosing the right version of Java and Tomcat
2677 posts, page by page
Link to page ... 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54 at 50 posts per page


This is a page archived from The Horse's Mouth at http://www.wellho.net/horse/ - the diary and writings of Graham Ellis. Every attempt was made to provide current information at the time the page was written, but things do move forward in our business - new software releases, price changes, new techniques. Please check back via our main site for current courses, prices, versions, etc - any mention of a price in "The Horse's Mouth" cannot be taken as an offer to supply at that price.

Link to Ezine home page (for reading).
Link to Blogging home page (to add comments).

You can Add a comment or ranking to this page

© WELL HOUSE CONSULTANTS LTD., 2010: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • FAX: 01144 1225 344596 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho