Counting character groups in a string? - Writing PHP

Training, Open Source computer languages

Perl • PHP • Python • MySQL • Apache / Tomcat • Tcl • Ruby • Java • C and C++ • Linux • CSS

For 2023 (and 2024 ...) - we are now fully retired from IT training.
We have made many, many friends over 25 years of teaching about Python, Tcl, Perl, PHP, Lua, Java, C and C++ - and MySQL, Linux and Solaris/SunOS too. Our training notes are now very much out of date, but due to upward compatability most of our examples remain operational and even relevant ad you are welcome to make us if them "as seen" and at your own risk.

Lisa and I (Graham) now live in what was our training centre in Melksham - happy to meet with former delegates here - but do check ahead before coming round. We are far from inactive - rather, enjoying the times that we are retired but still healthy enough in mind and body to be active!

I am also active in many other area and still look after a lot of web sites - you can find an index ((here))

Counting character groups in a string?

Posted by pgroves (pgroves), 24 October 2002

Hi - I need to process a large (1.2 Mb) text file, on each line there is a string that looks like this:

C01.252.200.500.550.800
C01.252.354.400
C01.252.410.890.328
etc.

For reasons I'm not going to go into here, I need to count the number of three charcter groups (seperated by a ".") for each line, and was wondering which was the most efficient way to do this? I can do this using explode, e.g:

Code:

$bits = explode(".",$data[1]);

Where $data is a line and $bits is the array of matches.
I can then count the number of 3 character groups by doing:

Code:

$level = count($bits);

Or is it more efficient to use a regular expression to match each three charcter group and then get the size of the returned array? In any case how would this be coded? I can't seem to get it right, here's my attempt:

Code:

ereg('[A-Z0-9]{3}$+',$data[1],$bits);

I'm obviously not doing this right, as Code:

count($bits)

is always 1, but I don't know how to do it correctly (still getting my head around regular expressions) - could someone help?

Also - how do you count the number of matches an in a regular expression? Is $count($bits) the right way?

cheers

Paul

Posted by admin (Graham Ellis), 24 October 2002

First thought .... if the characters are explicitly 3 character groups
between each "." as you seem to imply, why not simply write:
Code:

$ngroups = (count($data[1]) + 1 ) / 4;

There's also a function called substr_count that counts the number of
occurrences of one string in another, so
Code:

$nperiods = substr_count($data[1],".");

Now I confess I've never used that one myself, but it strikes me it's
pretty likely to be efficient.

Posted by pgroves (pgroves), 24 October 2002

on 10/24/02 at 14:02:39, Graham Ellis wrote:

First thought .... if the characters are explicitly 3 character groups
between each "." as you seem to imply, why not simply write:
Code:

$ngroups = (count($data[1]) + 1 ) / 4;

There's also a function called substr_count that counts the number of
occurrences of one string in another, so
Code:

$nperiods = substr_count($data[1],".");

Now I confess I've never used that one myself, but it strikes me it's
pretty likely to be efficient.

Out of interest I ran the different methods on our server and timed how long each one took (averaged over 4 goes), the results were:

Explode: 5.7 secs
Divide by 4: 5.6 secs
Substr: 5.2 secs

So there's not much in it really, though it possibly looks like Substr might be the quickest

BTW how *would* you count the number of 3 character matches using ereg?

cheers

Paul

Posted by admin (Graham Ellis), 24 October 2002

Within a regular expression, you use round brackets around groups you want to capture, otherwise you just get one string returned and that's the entire match - that's why you got a count of just 1.

Amazingly, although I'm a fan of regular expressions I'm going to discourage you from using them in this case; one of their weaknesses is that if you have a bracket with a count after it, only the LAST match to that bracket will be saved into the target match variable which would be a problem we would have to work around in your example. You would also be in some trouble if you have more that 9 groups, and ereg silently discards the 10th and subsequent matches ....

Summary, Rgeular expressions are great, but not for what you want to do

P.S. Timing differences may be more significant than you think; how long does it take to run your program and no nothing at all? I wonder how much of your 5.something seconds are consumed by reading the file rather than by the matching

Posted by pgroves (pgroves), 24 October 2002

on 10/24/02 at 14:39:22, Graham Ellis wrote:

Within a regular expression, you use round brackets around groups you want to capture, otherwise you just get one string returned and that's the entire match - that's why you got a count of just 1.

Amazingly, although I'm a fan of regular expressions I'm going to discourage you from using them in this case; one of their weaknesses is that if you have a bracket with a count after it, only the LAST match to that bracket will be saved into the target match variable which would be a problem we would have to work around in your example. You would also be in some trouble if you have more that 9 groups, and ereg silently discards the 10th and subsequent matches ....

Summary, Rgeular expressions are great, but not for what you want to do

P.S. Timing differences may be more significant than you think; how long does it take to run your program and no nothing at all? I wonder how much of your 5.something seconds are consumed by reading the file rather than by the matching

I tried running the program on just 200 lines of text, but it happens too quickly to notice *any* significant differences!

Paul

This page is a thread posted to the opentalk forum at www.opentalk.org.uk and archived here for reference. To jump to the archive index please follow this link.

You can Add a comment or ranking to this page

Public Training Courses

Running regularly at our UK training Centre.
[Schedule] - [About] - [Book]

Other Forum Posts

Sessions Problem

PHP Workshop for the hobbyist / club user

Uploading images/other binaries to MySQL via PHP

Data can't be inserted into Access database

Defensive OO Programing in PHP

ob_start() is inactive !!

Creatin fields in a table from an array

WHY CANT I PASS THE SIZE VARIABLE FROM Gorder.html

Array Push to skip first line

Retain the data on "Back Button " click of Browser

Saving image as base64

Using a redirect in a PHP mail form

reading image from mysql

PHP and Google Maps API

PHP - processing images as mySQL Blobs

How to grab the databse of other sites databse

Images from a database, Multiple images per page

php passing and assigning variables

php ignore html

I need to block users from creating an account...

date of the last added Link

xml and php problem

Security and safemode

Session question

Manipulation of pdf files thr php

PHP, Basic flat file shout box HELP

function to add HTML tags to text

preg_match to validate form

resize image before saving to mysql as blob

Sending mail with a order from a customer

any one help me to convert this code to php

Image and form upload

Inserting an image as a blob in PHP

Trouble with a Calendar Script

PHP and Regular Expressions

Regarding "last updated"

Uploading images to a website using PHP

Count links under a category

Simple Integration Method PHP trouble

Blob Images with Functions

Benchmarking code

Cusor postion in textcontrol with java script

Invalid argument supplied for foreach()

Seeking Freelance PHP/MySQL Coder

Character Encoding in PHP

URL variable passing proble:

Top ten downloads

premature end of script headers

String searching

HTTP_REFERER nothing there?

MySQL Upload Script

Writing RE in PHP 5

Extracting html form input elements

serve files from PHP

addslashes() & stripslashes() question

removing carriage returns/line feeds/line breaks

Looking for a freelance PHP/MySQL programmer

Quotes in Variable escaping Form Value

Notice: Undefined index: error in script

looking for ImageGif

Inserting data into a database

Telnet from PHP

how to make cursor work twice

Querying multiple table

using the correct tools?

PHP Website Hosting

Digging out embedded tags with preg_match_all()

Validate HTML offline on LAMP Dev Server

Splitting Strings

Assigning variables by value and reference

Concat and Declare variable in Oracleusing OCI

executing package not working

How to avoid Resource_ID Error ?

Oracle Package Bind Error

Resource Id Error

// indicating 'currentpage' state with PHP/CSS

Passing a URL variable

Warning: Cannot modify header information

open file, change content, save file

Does a particular URL exist?

outputing images from a php script,help.

Clearing history, cookies and cache files from PHP

Email Addresses for a particular company

Sendmail goes to-BUlk-folder

Wanted - include relative to include

Email Notification

Reading file contents

Looking 4 PHP function to load + display page.

Moved: Looking for a php professional

Using the header function

Help please! User Password  

ociplogon problem

Problem on user varification?

To Retrive clob datatype using oci funtions in php

How to Travese a xml string

regular expression help

PHP / MYSQL SEARCH ENGINE RANKINGS

Grab information from another Website

Regular Expression Problem

Trouble with MySQL command in script

Deleting a cookie

Objects, multithreaded web servers and persistance

summing a row in mysql

Random image in a web page via PHP?

Data truncation copying from file to mySQL table

My PHP tags aren't being interpretted

9 ways of sorting plus 2

TripleDES encryption

pack and unpack

PHP/SMTP Mail Problem

Webserver permissions and PHP

Help - input used to go to sendmail now to file

retrieving blobs with names

decoding entities in PHP

Which books to choose ?

public beta testing of PHP Editor

email with required fields

Arrays (?) and URLs

Using convert_uuencode to store images in MySQL DB

Help passing variables

php.ini under Mac OS X

hivalidation on submit

Valdiating Forms Using preg_match

Portability - questions for new programmers

PHP - GD - ImageMAP - Problems

Passing PHP Variables in a URL

is XSLT available in PHP?

Warning: Unknown():

Sending info to a new page

shopping list in PHP

math equation in php

php sql problem

History and background

Looking for a script in PHP

Html / PHP form into MySQL DB

Writing form field names to text file

Using a Form to write variables to a php page

Refreshing a page with PHP

Cloaked redirection

Redirect to another page

Script works online, but not locally??

Multi-dimensional arrays

Count query in MySQL

Resource id #3 - no data from simple queries

To use .gif, .png or .jpg?

Objects in PHP5

Spot the mistake??

create html table WITHOUT pear

create php table to input a  string Can you hel ?

Jazzing up PHP Emails!

addition of time

login script - tips/security

Moved: My root account is messed up - what can I do?

Cookies 101 with a twist

Adding spaces to a string

Uploading an image

Trying to refresh page

PHP & sendmail frustration

getdate () problem

program file extension .php4

Pulling data from a mySql table

Clearing/resetting a page

Code misses out when data has spaces

Quickie on preg_match

Link to site not working ??

php counter for web pages

really stupid question

select-option list

counting the number if params passed to function

Where is php.ini under Mac OSX

Deleting duplicates from an array

strip_tags stripping hyphen

Trouble pulling data using reg exp.

iffy if statement ?

math in php / mysql

Failing to write all data to file

deleting a file from within a php script

Sending PDF File to Browser via PHP

How long can a string be?

Warning stat failed for...

I need some input

DIRNAME (php 4.3.0)

User input and other checking

New PHP release

Which browser is being used?

Regular exressions in PHP

Searching a string for HTML tags

Using the back button in Netscape

Passing many SQL queries to one mysql_query

Removing last character of a string

Counting character groups in a string?

keyword searches

Retreiving a web page in PHP

PHP Form Processing Template

What is '&new'  ?

Making every page on a site into PHP

Session demo in a nutshell

Handling Comman Separated Variables in PHP

Incompatability in 4.1.2 over 4.0.x

Passing variables from page to page

Using booleans in PHP

Sending form variables to a text file

I cannot access MySQL from PHP

Passwords for MySQL; a conundrum??

PHP Independent of database - How?

Writing form variables to a file

Follow us on ...

© WELL HOUSE CONSULTANTS LTD., 2024: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • FAX: 01144 1225 793803 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho