Home Accessibility Courses Twitter The Mouth Facebook Resources Site Map About Us Contact
 
For 2023 (and 2024 ...) - we are now fully retired from IT training.
We have made many, many friends over 25 years of teaching about Python, Tcl, Perl, PHP, Lua, Java, C and C++ - and MySQL, Linux and Solaris/SunOS too. Our training notes are now very much out of date, but due to upward compatability most of our examples remain operational and even relevant ad you are welcome to make us if them "as seen" and at your own risk.

Lisa and I (Graham) now live in what was our training centre in Melksham - happy to meet with former delegates here - but do check ahead before coming round. We are far from inactive - rather, enjoying the times that we are retired but still healthy enough in mind and body to be active!

I am also active in many other area and still look after a lot of web sites - you can find an index ((here))
Unique word locator - Python dict example

If a word occurs only once in my blog - all 4661 entries so far - chances are that it's a mis-spelling. And using a dict in Python, I can quickly parse a data stream with lots of text in it, isolate individual words, and see how many times each occurs.

Dictionaries are a very quick and easy way of looking up keys (they're used internally fo rvariable names in most scripting languages) so this runs really fast.

  import re
  word = re.compile(r'[A-Z]{2,}',re.I)
  wordcount = {}
  for line in open("blog"):
    words = word.findall(line)
    for item in words:
      i2 = item.lower()
      wordcount[i2] = wordcount.get(i2,0) + 1


I can then sort and output my answers. You can't sort a dict - but you can sort a list of keys

  used = wordcount.keys()
  used.sort(lambda y,x:wordcount[x]-wordcount[y])
  for item in used:
    print item, wordcount[item]


Sometimes, I mistype our dommain name "wellhousemanor" ... let's see

  WomanWithCat:f2916 grahamellis$ python uniwords | grep manor
  manor 1094
  wellhousemanor 547
  showmanor 5
  wellhousmanor 4
  theoldmanor 2
  manorsnow 2
  manordawn 2
  manorgarden 1
  wellhouesemanor 1
  wellhhousemanor 1
  manorside 1
  manorembossed 1
  manorcard 1
  manorgant 1
  greatchalfieldmanor 1
  manordaffs 1
  wellmousemanor 1
  WomanWithCat:f2916 grahamellis$



Well Mouse Manor ;-)

Complete source - [here]
(written 2016-03-06, updated 2016-03-08)

 
Associated topics are indexed as below, or enter http://melksh.am/nnnn for individual articles
Y107 - Python - Dictionaries
  [103] Can't resist writing about Python - (2004-10-29)
  [955] Python collections - mutable and imutable - (2006-11-29)
  [1144] Python dictionary for quick look ups - (2007-04-12)
  [1145] Using a list of keys and a list of values to make a dictionary in Python - zip - (2007-04-13)
  [2368] Python - fresh examples of all the fundamentals - (2009-08-20)
  [2915] Looking up a value by key - associative arrays / Hashes / Dictionaries - (2010-08-11)
  [2986] Python dictionaries - reaching to new uses - (2010-10-05)
  [2994] Python - some common questions answered in code examples - (2010-10-10)
  [3464] Passing optional and named parameters to python methods - (2011-10-04)
  [3488] Python sets and frozensets - what are they? - (2011-10-20)
  [3554] Learning more about our web site - and learning how to learn about yours - (2011-12-17)
  [3555] Football league tables - under old and new point system. Python program. - (2011-12-18)
  [3662] Finding all the unique lines in a file, using Python or Perl - (2012-03-20)
  [3934] Multiple identical keys in a Python dict - yes, you can! - (2012-11-24)
  [4027] Collections in Python - list tuple dict and string. - (2013-03-04)
  [4029] Exception, Lambda, Generator, Slice, Dict - examples in one Python program - (2013-03-04)
  [4409] Setting up and using a dict in Python - simple first example - (2015-01-30)
  [4469] Sorting in Python 3 - and how it differs from Python 2 sorting - (2015-04-20)
  [4668] Sorting a dict in Python - (2016-04-01)


Back to
What is happening on the 231 bus? What are you going to do about it?
Previous and next
or
Horse's mouth home
Forward to
Recursion in Python - the classic example
Some other Articles
Chippenham to Salisbury by public transport - what we have and what we could have
Mallory Place bus stop - services to Bath
Easy data to object mapping (csv and Python)
Recursion in Python - the classic example
Unique word locator - Python dict example
What is happening on the 231 bus? What are you going to do about it?
Prining a pound sign from Python AND running from the command line at the same time
The end of competition on a bus route - the effects from then end of the 234
Rumours of bus changes by First in Wiltshire - what we know and suspect
Identifying the first and last records in a sequence
4759 posts, page by page
Link to page ... 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96 at 50 posts per page


This is a page archived from The Horse's Mouth at http://www.wellho.net/horse/ - the diary and writings of Graham Ellis. Every attempt was made to provide current information at the time the page was written, but things do move forward in our business - new software releases, price changes, new techniques. Please check back via our main site for current courses, prices, versions, etc - any mention of a price in "The Horse's Mouth" cannot be taken as an offer to supply at that price.

Link to Ezine home page (for reading).
Link to Blogging home page (to add comments).

You can Add a comment or ranking to this page

© WELL HOUSE CONSULTANTS LTD., 2024: 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho

PAGE: http://www.wellho.net/mouth/4661_.html • PAGE BUILT: Sun Oct 11 16:07:41 2020 • BUILD SYSTEM: JelliaJamb