| |||||||||||
| |||||||||||
Where has Googlebot crawled?
this example from a Well House Consultants training course
Source code: googletrace Module: R050
# Where has Googlebot crawled? fh = File.new "../data/ac_20110201" gl = 0 counter = {} fh.readlines.each do |lyne| # Quickly eliminate lines that don't include "Googlebot" next unless lyne.match("Googlebot") gl += 1 # Collect the page that was trawled lyne =~ /"GET\s+([^\?\s]+)/ # Generate a has of counters counter[$1] = 0 unless counter[$1] counter[$1] += 1 end print "Googlebot has trawled #{gl} pages\n" # Sort the pages trawled by the number of times that each was read by Googlebot # And if the numbers are the same for two pages, sort them alphabetically pagestrawled = counter.keys.sort {|a, b| (n = counter[a] <=> counter[b]) == 0 ? a <=> b : n} # And output the results pagestrawled.each do |page| puts "#{counter[page]} - #{page}" end __END__ 92:februb grahamellis$ ruby googletrace Googlebot has trawled 8378 pages 1 - /YaBBImages/wink.gif 1 - /accom/BA3.html 1 - /accom/embackup.php4 1 - /adhoc/adhoc_sql_query_engine.php 1 - /alternate/perl-training-course_index.html 1 - /alternate/uk_index.html 1 - /archives/2006/09/add_to_shopping.html 1 - /archives/2006/09/making_pages_cl.html 1 - /archives/2006/09/morgans_hill.html 1 - /archives/2006/10/courses_at_well.html 1 - /archives/2006/10/open.html 1 - /archives/2006/12/index.html [etc] 15 - /demo/popup.php 17 - /melkshow/index.html 19 - /net/setcountry.php4 21 - /ask/index.php 21 - /demo/dropahead.php 24 - /net/quote.html 26 - /newsletter/ 29 - /demo/mqc.php 37 - /demo/newsletter.php 41 - /demo/mqchunks.php 59 - /robots.txt 146 - /demo/mqclim.php 152 - /net/images.php4 167 - /solutions/mouth.html 221 - /resources/smap.php 255 - /demo/ifvswit.php 344 - /demo/picclim.php 1689 - /resources/ex.php4 92:februb grahamellis$ Learn about this subject
This module and example are covered as required on private courses.
Should you wish to cover this example and associated subjects, and you're attending a public course
to cover other topics with us, please see our extra topic program.
Books covering this topic
Yes. We have over 700 books in our library. Books
covering Ruby are listed here and when you've selected a
relevant book we'll link you on to Amazon to order.
Other Examples
This example comes from our "this" training module. You'll find a description of the topic and some
other closely related examples on the "this" module index page.
Full description of the source code
You can learn more about this example on the training courses listed on this page,
on which you'll be given a full set of training notes.
Many other training modules are available for download (for limited use) from our download centre under an Open Training Notes License. Other resources
• Our Solutions centre provides a number of longer technical articles.
• Our Opentalk forum archive provides a question and answer centre. • The Horse's mouth provides a daily tip or thought. • Further resources are available via the resources centre. • All of these resources can be searched through through our search engine • And there's a global index here. Web site author
Purpose of this website
This is a sample program, class demonstration or answer from a
training course. It's main purpose
is to provide an after-course service to customers who have attended our
public private or
on site courses, but the examples are made
generally available under conditions described below.
Conditions of use
Past attendees on our training courses are welcome to use individual
examples in the course of their programming, but must check
the examples they use to ensure that they are suitable for their
job. Remember that some of our examples show you how not to do
things - check in your notes. Well House Consultants take no responsibility
for the suitability of these example programs to customer's needs.
This program is copyright Well House Consultants Ltd. You are forbidden from using it for running your own training courses without our prior written permission. See our page on courseware provision for more details. Any of our images within this code may NOT be reused on a public URL without our prior permission. For Bona Fide personal use, we will often grant you permission provided that you provide a link back. Commercial use on a website will incur a license fee for each image used - details on request. |
| ||||||||||
PH: 01144 1225 708225 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho PAGE: http://www.wellho.net/resources/ex.php • PAGE BUILT: Sun Oct 11 14:50:09 2020 • BUILD SYSTEM: JelliaJamb |