Home Accessibility Courses Twitter The Mouth Facebook Resources Site Map About Us Contact
Unique word locator - Python dict example

If a word occurs only once in my blog - all 4661 entries so far - chances are that it's a mis-spelling. And using a dict in Python, I can quickly parse a data stream with lots of text in it, isolate individual words, and see how many times each occurs.

Dictionaries are a very quick and easy way of looking up keys (they're used internally fo rvariable names in most scripting languages) so this runs really fast.

  import re
  word = re.compile(r'[A-Z]{2,}',re.I)
  wordcount = {}
  for line in open("blog"):
    words = word.findall(line)
    for item in words:
      i2 = item.lower()
      wordcount[i2] = wordcount.get(i2,0) + 1


I can then sort and output my answers. You can't sort a dict - but you can sort a list of keys

  used = wordcount.keys()
  used.sort(lambda y,x:wordcount[x]-wordcount[y])
  for item in used:
    print item, wordcount[item]


Sometimes, I mistype our dommain name "wellhousemanor" ... let's see

  WomanWithCat:f2916 grahamellis$ python uniwords | grep manor
  manor 1094
  wellhousemanor 547
  showmanor 5
  wellhousmanor 4
  theoldmanor 2
  manorsnow 2
  manordawn 2
  manorgarden 1
  wellhouesemanor 1
  wellhhousemanor 1
  manorside 1
  manorembossed 1
  manorcard 1
  manorgant 1
  greatchalfieldmanor 1
  manordaffs 1
  wellmousemanor 1
  WomanWithCat:f2916 grahamellis$



Well Mouse Manor ;-)

Complete source - [here]
(written 2016-03-06, updated 2016-03-08)

 
Associated topics are indexed as below, or enter http://melksh.am/nnnn for individual articles
Y107 - Python - Dictionaries
  [4668] Sorting a dict in Python - (2016-04-01)
  [4469] Sorting in Python 3 - and how it differs from Python 2 sorting - (2015-04-20)
  [4409] Setting up and using a dict in Python - simple first example - (2015-01-30)
  [4029] Exception, Lambda, Generator, Slice, Dict - examples in one Python program - (2013-03-04)
  [4027] Collections in Python - list tuple dict and string. - (2013-03-04)
  [3934] Multiple identical keys in a Python dict - yes, you can! - (2012-11-24)
  [3662] Finding all the unique lines in a file, using Python or Perl - (2012-03-20)
  [3555] Football league tables - under old and new point system. Python program. - (2011-12-18)
  [3554] Learning more about our web site - and learning how to learn about yours - (2011-12-17)
  [3488] Python sets and frozensets - what are they? - (2011-10-20)
  [3464] Passing optional and named parameters to python methods - (2011-10-04)
  [2994] Python - some common questions answered in code examples - (2010-10-10)
  [2986] Python dictionaries - reaching to new uses - (2010-10-05)
  [2915] Looking up a value by key - associative arrays / Hashes / Dictionaries - (2010-08-11)
  [2368] Python - fresh examples of all the fundamentals - (2009-08-20)
  [1145] Using a list of keys and a list of values to make a dictionary in Python - zip - (2007-04-13)
  [1144] Python dictionary for quick look ups - (2007-04-12)
  [955] Python collections - mutable and imutable - (2006-11-29)
  [103] Can't resist writing about Python - (2004-10-29)


Back to
What is happening on the 231 bus? What are you going to do about it?
Previous and next
or
Horse's mouth home
Forward to
Recursion in Python - the classic example
Some other Articles
Chippenham to Salisbury by public transport - what we have and what we could have
Mallory Place bus stop - services to Bath
Easy data to object mapping (csv and Python)
Recursion in Python - the classic example
Unique word locator - Python dict example
What is happening on the 231 bus? What are you going to do about it?
Prining a pound sign from Python AND running from the command line at the same time
The end of competition on a bus route - the effects from then end of the 234
Rumours of bus changes by First in Wiltshire - what we know and suspect
Identifying the first and last records in a sequence
4722 posts, page by page
Link to page ... 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95 at 50 posts per page


This is a page archived from The Horse's Mouth at http://www.wellho.net/horse/ - the diary and writings of Graham Ellis. Every attempt was made to provide current information at the time the page was written, but things do move forward in our business - new software releases, price changes, new techniques. Please check back via our main site for current courses, prices, versions, etc - any mention of a price in "The Horse's Mouth" cannot be taken as an offer to supply at that price.

Link to Ezine home page (for reading).
Link to Blogging home page (to add comments).

You can Add a comment or ranking to this page

© WELL HOUSE CONSULTANTS LTD., 2017: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • FAX: 01144 1225 899360 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho

PAGE: http://www.wellho.net/mouth/4661_Uni ... ample.html • PAGE BUILT: Sat Jun 11 12:16:26 2016 • BUILD SYSTEM: WomanWithCat