Sponsorised links
This year
2007
Mastering Oracle Python, Part 3: Data Parsing
There are countless reasons for parsing data, as well as tools and techniques to do it. But even the "right" tool may be insufficient when you need to do something new with the data. The same concerns exist for the integration of heterogeneous data sources. Sooner or later, the right tool for the right job happens to be a programming language.
Oracle offers some very powerful utilities for loading, processing, and unloading data. SQL*Loader, Data Pump, external tables, Oracle Text, regular expressions—it's all there. Yet there is often a need to do things outside the database (or, trivially, perhaps you just weren't granted the necessary database privileges).
Python delivers possibilities for efficient data parsing at a high level. The extensive standard library and many modules available for free on the Internet make it possible to work with data logic rather than dissecting bytes by hand.
Sponsorised links
2006
pyconstruct »
Construct is a python library for parsing and building of arbitrary data structures. It works by defining complex constructs using more primitive ones, in a hierarchy. This declarative structuring of components makes it very flexible and strong. It's the first library that makes parsing fun, instead of the usual headache it is today.
Perl.com: Lexing Your Data
Most of us have tried at one time or another to use regular expressions to do things we shouldn't: parsing HTML, obfuscating code, washing dishes, etc. This is what the technical term "showing off" means.
2005
HANDY ONE-LINERS FOR SED
The essential, official compendium of useful sed one-liners. Organised into sections by usage, such as file spacing, line numbering, selective line removal/deletion and optimisation. Updated 10 April 1998.
VObject Home
Vobject is intended to be a full featured Python package for parsing and generating vCard and vCalendar files.
URL Parsing With WSGI And Paste
This document is intended for web framework authors and integrators, and people who want to understand the internal architecture of Paste.
