public marks

PUBLIC MARKS with tag datamining

2011

Anne Kaneko's Blog: Radiation (1)

by karlcow

Borrowing some data from the source below this could be 2.1 mSv for the year to 11 March 2012. Well, the same as the UK average but well above the Japanese recommended  limit.

The Britney Spears Problem - Stream Gauges

by karlcow

Suppose a pipeline delivers an endless sequence of nonnegative integers at a steady rate of one number every t time units. We want to build a device—call it a stream gauge—that intercepts the stream and displays answers to certain questions about the numbers.

Pattern | CLiPS

by karlcow

Pattern is a web mining module for the Python programming language.

2010

Mining of Massive Datasets

by karlcow

Chapter 1 Data Mining

Chapter 2 Large-Scale File Systems and Map-Reduce

Chapter 3 Finding Similar Items

Chapter 4 Mining Data Streams

Chapter 5 Link Analysis

Chapter 6 Frequent Itemsets

Chapter 7 Clustering

Chapter 8 Advertising on the Web

Chapter 9 Recommendation Systems

copenhagen wheel project

by karlcow
use your phone to unlock and lock your bike, change gears and select how much the motor assists you. As you cycle, the wheel’s sensing unit is also capturing your effort level and information about your surroundings, including road conditions, carbon monoxide, NOx, noise, ambient temperature and relative humidity. Access this data through your phone or the web and use it to plan healthier bike routes, to achieve your exercise goals or to meet up with friends on the go. You can also share your data with friends, or with your city - anonymously if you wish – thereby contributing to a fine-grained database of environmental information from which we can all benefit.

Réseaux sociaux, analyse et data mining

by nhoizey
"Le groupe “Data Mining et Apprentissage” de la SFdS (Société française de statistique) organise cette journée d’études pour introduire le domaine de l’analyse des réseaux sociaux : problèmes posés par la fouille de données et l’estimation de modèles, nouveaux algorithmes, mise en œuvre industrielle et problématiques issues des nouvelles applications "Web 2.0". "

Tom Morris: 2010-02-22

by karlcow

karl15 [Moderator] Yesterday 07:54 PM16

Another small tracking system when you are using different computers and/or sending emails from different contexts.

It requires a burden though, sending yourself copy of your emails.

the email will contain a Received: field, that you can extract with ip address from where the mail has been sent.

You can imagine a process which put yourself in bcc and then delete the email. A bit hacky whacky but could work.

writing | ben fry » Taking the “vs.” out of Man & Machine

by karlcow

“contrary to traditional assumptions, the uniquely human faculty of reason (conscious, intelligent, rational thought) requires very little computation, but that the unconscious sensorimotor skills and instincts that we share with the animals require enormous computational resources”

2009

Textual Log Analysis using Python « Isotoma Blog

by karlcow

Now, having logs of the channel reaching many megabytes, I was curious as to the text statistics produced by this channel, who has what reading age, and how much they’ve talked in comparison to other people.

Searchable: Annotation-Driven Indexing and Searching with Lucene :: Drive-by Digressions

by karlcow

Searchable is a toolkit for Lucene that harnesses the power of annotations to specify what properties to index and how to treat them.

PhotoMaker

by karlcow

This small mashup uses YQL to combine the power of Flickr and Yahoo! Placemaker. Copy and paste some text into the textbox below and click the button — application will first geo-locate all places in the text, and then will try to find photographs, published under Creative Commons license, which were geotagged at or near found places.

interesting how Placemaker reacts differently to a text with "Rouen" and "at Rouen"

MIT Media Lab: Reality Mining

by karlcow & 1 other

Reality Mining defines the collection of machine-sensed environmental data pertaining to human social behavior. This new paradigm of data mining makes possible the modeling of conversation context, proximity sensing, and temporospatial location throughout large communities of individuals. Mobile phones (and similarly innocuous devices) are used for data collection, opening social network analysis to new methods of empirical stochastic modeling.

The original Reality Mining experiment is one of the largest mobile phone projects attempted in academia. Our research agenda takes advantage of the increasingly widespread use of mobile phones to provide insight into the dynamics of both individual and group behavior. By leveraging recent advances in machine learning we are building generative models that can be used to predict what a single user will do next, as well as model behavior of large organizations.

Official Google Blog: 30,000 new Google Apps business users at Valeo

by night.kame

This marks a significant moment for Google Apps, because Valeo has 30,000 Internet-connected employees, making this one of the largest enterprise deployments of Google Apps to date. Valeo is moving to the cloud

Bientôt, Google Suggests powered by Valeo. On n'aura même plus besoin de faire de la veille techno.

2008

wiki.dbpedia.org : About

by Spone
DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to make sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data.

Logiciel statistique et datamining

by cyril38130 & 1 other
Outil de référence du chargé d'étude, Sphinx Plus² permet de déployer les enquêtes sur des supports multiples : web, téléphone, scanner, PDA. Le logiciel vous accompagnera de la création du questionnaire au dépouillement des données.

Télécharger Sphinx

by cyril38130
Telecharger gratuitement une version d'évaluation du logiciel Sphinx pour vos enquête , analyse de données, sondage, reporting...

2007

Active users

karlcow
last mark : 24/08/2011 16:27

rwatuny
last mark : 01/10/2010 13:43

nhoizey
last mark : 30/03/2010 10:57

jey
last mark : 09/09/2009 14:53

night.kame
last mark : 13/05/2009 21:39

Spone
last mark : 22/08/2008 23:28

kiad
last mark : 10/07/2008 11:31

after8
last mark : 16/03/2008 04:00

cyril38130
last mark : 28/08/2008 08:38

signalsurf
last mark : 25/11/2007 20:11

Yann_L
last mark : 18/11/2007 15:42

Regis
last mark : 16/06/2007 13:44

_Nico
last mark : 15/06/2007 10:11