search engines and question marks

Training, Open Source computer languages

Perl • PHP • Python • MySQL • Apache / Tomcat • Tcl • Ruby • Java • C and C++ • Linux • CSS

Home

Accessibility

For 2023 (and 2024 ...) - we are now fully retired from IT training.
We have made many, many friends over 25 years of teaching about Python, Tcl, Perl, PHP, Lua, Java, C and C++ - and MySQL, Linux and Solaris/SunOS too. Our training notes are now very much out of date, but due to upward compatability most of our examples remain operational and even relevant ad you are welcome to make us if them "as seen" and at your own risk.

Lisa and I (Graham) now live in what was our training centre in Melksham - happy to meet with former delegates here - but do check ahead before coming round. We are far from inactive - rather, enjoying the times that we are retired but still healthy enough in mind and body to be active!

I am also active in many other area and still look after a lot of web sites - you can find an index ((here))

search engines and question marks

Posted by wilko (wilko), 27 November 2003

I have built a web site using Perl that is not being indexed by Google and I understand this is because it doesn't like my use of variables in URLs.

Before I look into rewrite engines and modifying my scripts I thought I'd share this with you to see if you have any pointers.

A typical url from my site:
http://www.trintech.com/index.cgi?n0=NAE&n1=213122241451005836515

I guess I would be looking to rewrite this to something like:
http://www.trintech.com/NAE/213122241451005836515.html

Any tips welcome.

Posted by Custard (Custard), 27 November 2003

Hi, not sure I'm going to be much use here, but..

How does the search engine find the url you have listed..

Ie. is there a HREF= link to that precise url.

as far as I understand, the crawler will look at links with HREFs and index them. If the url is the result of a form, then I would imagine it had little chance of getting there.

Also if the link is embedded in javascript it may not find it.

I think google etc will follow links into CGI scripts though. I have seen quite a lot indexed, but where the real page has now disappeared because it was dynamicaly generated at the time of the crawl.

HTH

B

Posted by admin (Graham Ellis), 27 November 2003

My understanding is that Google will visit pages with Get information in the URL (i.e. with a ?), but won't then follow any further links from those pages. This, apparently, is to avoid recursive indexing where a whole lot of URLs all really refer to the same page. Certainly there are may pages on www.wellho.net that are indexed including a get string and bring us many visitors.

Posted by wilko (wilko), 2 December 2003

The urls are all created on the fly but do remain consistent between visits so can be book-marked and SEs won't have broken links. One point raised that I have been choosing to ignore is dynamically generated links in menus. I am going to have ensure that the site can be fully navigated with JavaScript off, through use of a site map, just for the benefit of the crawler.

Then there is the issue of getting good rankings. It's a minefield and I guess this is why we have consultants who specialise in the area.

Thanks for your input.

This page is a thread posted to the opentalk forum at www.opentalk.org.uk and archived here for reference. To jump to the archive index please follow this link.

You can Add a comment or ranking to this page

Public Training Courses

Running regularly at our UK training Centre.
[Schedule] - [About] - [Book]

Other Forum Posts

[C] error returning array pointer from function

Need help in expect script(Reading a array)

ramkriz

removing periods with a shell script

File uid to Username in C++ using lstat

Reading files in C using system calls

Changing filenames in shell script

Array in Bourne Shell

Grep filename in shell script

Count number of files

C string funtion

Sprintf memory leak ?????????????????

date function in shell cmds

shell script

XML

vbscript on Windows

Looping within a shell script

Generating Smaller (filesize) Thumbnails

Unixtime Convertion

ENUM  nested within a struct defined within C++

expression parser

double comparisons within an if stat

Double type rounding

ITU codes

XML transformation application

Just try to solveit, if you can solve

Reading & analyzing of large files

number analysis algorithm

Opening a new window from a form

Can anyone help me with favicon?

Help us to test new CSS Editor

XSL entity file

County and State list

Verify file ownership in unix shell script

English check! An HTML file?

i want to learn lisp

Great Oak trees from little acorns grow

Setting up WebDAV on Apache

Any ideas - tags added to an email reply.

JavaScript - change selected text using button

apache re-write directories problem

MIB

Httpd to Tomcat, Proxy, code 302 problems

Javascript - resizing image to fit browser window

missing and corrupt files since cron backup script

crontab beginner

Which versions Mysql/Apache?

How do you test/debug PHP/Apache installation

search engines and question marks

Tomcat server shut down

wget - grabbing web pages from command line

procmailrc

how do I change hyperlink colours in HTML

calling system() in cgi

How do I get my site on the search engines?

Regular expressions in Fortran?

SVG?

Javascript

EXTENDED USE of this area

Running binary programs from a web site

What does "awk" stand for?

© WELL HOUSE CONSULTANTS LTD., 2024: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • FAX: 01144 1225 793803 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho