PUBLIC   marks

PUBLIC MARKS with tag algorithms

Sponsorised links

September 2008

Disco

by greut

Disco is an open-source implementation of the Map-Reduce framework for distributed computing. As the original framework, Disco supports parallel computations over large data sets on unreliable cluster of computers.

The Disco core is written in Erlang, a functional language that is designed for building robust fault-tolerant distributed applications. Users of Disco typically write jobs in Python, which makes it possible to express even complex algorithms or data processing tasks often only in tens of lines of code. This means that you can quickly write scripts to process massive amounts of data.

Disco was started at Nokia Research Center as a lightweight framework for rapid scripting of distributed data processing tasks. This far Disco has been succesfully used, for instance, in parsing and reformatting data, data clustering, probabilistic modelling, data mining, full-text indexing, and log analysis with hundreds of gigabytes of real-world data.

Erlang + Python = complete beautifulness

July 2008

Sponsorised links

June 2008

February 2008

November 2007

October 2007

teideal glic deisbhéalach » Blog Archive » Collaborative filtering made easy

by greut & 2 others

a Python implementation of Daniel Lemire’s Weighted Slope One collaborative filtering

a simple overview of collaborative filtering

September 2007

Google Code for Educators - Google: MapReduce in a Week

by greut
his page contains a comprehensive introduction to MapReduce including lectures, reading material, and programming assignments.

July 2007

April 2007

[Xiru].org — Python Genetic Algorithms

by greut & 1 other
nice quote in the header, need to give it a look.

March 2007

February 2007

January 2007

Новые патенты Yahoo и Microsoft

by ionial
патент по извлечению и распознаванию данных в HTML

December 2006

PUBLIC TAGS related to tag algorithms

no tag

Sponsorised links