public marks

PUBLIC MARKS with tag parser

This year

parcon 0.1.25 : Python Package Index

by karlcow

Parcon is a parser library. It can be used for parsing both normal text and binary data. It's designed to be easy to use and to provide informative error messages.

2011

introducing esprima: blazing-fast javascript parser | don't code today

by karlcow

In a nutshell, Esprima (esprima.org) is a JavaScript parser written in pure JavaScript.

neilkodner.com / An analysis of Steve Jobs tribute messages displayed by Apple

by karlcow
pas très intéressant pour l'apple mania, en revanche pour l'analyse textuelle.

Sitepatching - YouTube, Twitter, more Hotmail

by karlcow

PATCH-444, Make Twitter hashtags visible. Twitter does some script magic and ends up with broken element nesting: <span><strong>foo</span></strong> in turn causing Opera's layout engine to be upset. This will be fixed with the new parser so we'll just patch it meanwhile.

microdata.py at master from edsu/microdata - GitHub

by karlcow

python library for extracting html5 microdata

the EnAKTing blog › Fast SPARQL XML Results Parser in Python

by karlcow

re-wrote our original SPARQL XML results parser to use Expat, the non-validating (and fast) XML parser.

2010

Create your own textformat and parse it

by karlcow

implemented the parser as a simple state machine with no syntax tolerance

Sam Ruby: Scoping out a C++ HTML5 parser

by karlcow

As someone who attempted to keep an implementation of an HTML5 parser up-to-date for some period of time last year, I will say it’s time-consuming, thankless work, especially since the spec was changing a lot. Now that things have settled down a bit, it might be good to start making full implementations.

jabapyth's css at master - GitHub

by karlcow

A python parser of css powered by codetalker.

php-excel-reader - Project Hosting on Google Code

by Spone & 2 others
This PHP library expands on the great work done in the PHP Excel Reader project on SourceForge. It reads the binary format of XLS files directly and can return values and formats from any cell.

Extractomatic

by karlcow, 2 comments

Extractomatic is a simple API to detect and remove surplus clutter (such as adverts, headers, footers) around the main content of a web page. It uses the Boilerplate Java library, by Christian Kohlschütter.

jParser and jTokenizer released | Web 2.1

by Spone
After nearly two years I've finally gotten around to releasing my PHP JavaScript parser, although documentation is still thin on the ground.

2009

enriquepablo / nl / wiki / Home — bitbucket.org

by karlcow

nl is a python library, that exposes a declarative API that allows us to build sentences and rules. These are used as input for a knowledge base built on the CLIPS production system. CLIPS builds a Rete network with the rules and sentences, which can then be queried for the consecuences of those in a most efficient way.

The main claim of nl is to offer a syntax that can accommodate any coherent theory that we may build with the natural language (in the same sense as something like the semantic web's OWL-Full would), while at the same time being based on a simple finite domain first order theory. This theory is NL, a discussion of which can be found here. This discussion is probably required reading to understand the breadth and the limits of nl, but not to start using it.

ry's http-parser at master - GitHub

by karlcow

This is a parser for HTTP messages written in C. It parses both requests and responses. The parser is designed to be used in performance HTTP applications. It does not make any allocations, it does not buffer data, and it can be interrupted at anytime. It only requires about 128 bytes of data per message stream (in a web server that is per connection).

PUBLIC TAGS related to tag parser

api +   atom +   blog +   c +   c# +   code +   compiler +   computer +   csharp +   css +   delicious +   development +   developpement +   dom +   feed +   generator +   html +   html5 +   implementation +   java +   javascript +   jQuery +   libraries +   library +   markdown +   microformat +   microformats +   net tech +   open_source +   opensource +   perl +   php +   plugin +   programming +   python +   rdf +   rdfa +   rss +   ruby +   sourcecodeanalysis +   SPARQL +   text +   tools +   validation +   web +   webdev +   websemantique +   xhtml +   xml +   中文 +  

Active users

vrossign
last mark : 05/02/2012 11:15

karlcow
last mark : 16/01/2012 16:38

RETFU
last mark : 28/11/2011 08:32

Spone
last mark : 02/07/2010 18:09

marco
last mark : 13/05/2010 13:31

Jeremy B.
last mark : 05/04/2010 20:10

Geekye
last mark : 17/02/2010 10:54

webs
last mark : 25/11/2009 13:00