Huge files in Python - over 4 Gbytes - Programming in Python and Ruby

Training, Open Source computer languages

Perl • PHP • Python • MySQL • Apache / Tomcat • Tcl • Ruby • Java • C and C++ • Linux • CSS

Home

Accessibility

For 2023 (and 2024 ...) - we are now fully retired from IT training.
We have made many, many friends over 25 years of teaching about Python, Tcl, Perl, PHP, Lua, Java, C and C++ - and MySQL, Linux and Solaris/SunOS too. Our training notes are now very much out of date, but due to upward compatability most of our examples remain operational and even relevant ad you are welcome to make us if them "as seen" and at your own risk.

Lisa and I (Graham) now live in what was our training centre in Melksham - happy to meet with former delegates here - but do check ahead before coming round. We are far from inactive - rather, enjoying the times that we are retired but still healthy enough in mind and body to be active!

I am also active in many other area and still look after a lot of web sites - you can find an index ((here))

Huge files in Python - over 4 Gbytes

Posted by admin (Graham Ellis), 19 August 2004

Looking back just a few years, a file in excess of 4 Gbytes was unthinkable and files (or even) file systems were limited to 2^32 (2 to the power 32) bytes. These days, though, a file in excess of 4 Gb is perfectly possible on most (but not all) file systems and can be handled by most (but not all) languages.

In the last couple of days, I was asked about huge files in Python - rumours of problems were reported - and I wrote the following and tested it just fine to extract every millionth line from a 6.9 Gb file.

Code:

#/usr/bin/python

huge = open("huge.txt")
count = 0

for line in huge.xreadlines():
count += 1
if not (count % 1000000):
print str(count)+" "+line

Note - any construct that reads the whole of the file into memory at one do is going to fail ... that's why I chose xreadlines.

This page is a thread posted to the opentalk forum at www.opentalk.org.uk and archived here for reference. To jump to the archive index please follow this link.

You can Add a comment or ranking to this page

Public Training Courses

Running regularly at our UK training Centre.
[Schedule] - [About] - [Book]

Other Forum Posts

Python key error

Formatting SQL output

Awkward looking code

Variable scope in Python

Exception Handling

PYTHON on LINUX

python with tomcat on windows

Formatting

In Mysql, longblob is not storing more than 200kb

keyboard and mouse event

Python Imaging Library (PIL)

not able to access accessor function generated by

Sound control

could any one change this code to php

Zope and Plone Training - with or without Python

Unicode support in Ruby

Regular expressions - findall or split?

The future of Python

installation of rails off the rails!

Should I use from or import?

Game Programming with Python trouble

Win32API and GetFileVersionInfo

object relationships

Unable to put the plugins of Python in Eclipse3.1

Ruby: OpenGL | GNUT

python and pyqt

Jython:replace value with another value

Exception while running jython script

Jython:How  to replace parameters in URL path?

Python Challenge

Common Python questions

fancy code reviewing some python ?

what is the difference between range & xrange (py)

doubt in python program

Examples

Everything is an object

What can Phyton Really do?

Building sites with Zope and Python

Do you do Jython training?

Huge files in Python - over 4 Gbytes

hy "print >>" is wrong

CGI tables in Ruby

PyQt - Example

Python - Sort Order

Sorting a dictionary in Python

shuffling

Python and SQL / the DB API and Gadfly

Running shell commands from Python

Where do add an exception block

How far from your Python?

Printing out currencies in Python

Python comes to Opentalk! (RG25, 25th December 03)

New release of Ruby (SN12 20040103)

A Rosette for Ruby (NN10, 20030910)

What is Ruby? (LL57, 7th Oct 2003)

© WELL HOUSE CONSULTANTS LTD., 2024: Well House Manor • 48 Spa Road • Melksham, Wiltshire • United Kingdom • SN12 7NY
PH: 01144 1225 708225 • FAX: 01144 1225 793803 • EMAIL: info@wellho.net • WEB: http://www.wellho.net • SKYPE: wellho