PUBLIC   marks

PUBLIC MARKS with tag hadoop

Sponsorised links

June 2008

April 2008

February 2008

InfoQ: MapReduceとHadoopの将来について、YahooのDoug Cuttingにインタビュー

by kuroyagi (via)
"このInfoQスペシャルインタビューでは、YahooにおいてHadoopがどのように使われているか、その開発におけるチャレンジ、そしてプロジェクトの将来的な方向性についてCuttingが語ってくれています。"

TechCrunch Japanese アーカイブ » ヤフーが検索にHadoop採用、グーグルっぽく脱皮を図る

by kuroyagi
"Hadoopは大規模分散型コンピューティングを得意とするソフト。グーグルの「MapReduce(MR)」の実装と分散ファイルシステムを含むオープンソースの分散コンピューティングプラットフォームだ"

Welcome to Hadoop!

by karlcow

Hadoop is a software platform that lets one easily write and run applications that process vast amounts of data.

Sponsorised links

October 2007

Writing An Hadoop MapReduce Program In Python - Michael G. Noll

by ogrisel
Even though the Hadoop framework is written in Java, programs for Hadoop need not to be coded in Java but can also be developed in other languages like Python or C (the latter since version 0.14.1). However, the documentation and the most prominent Python example on the Hadoop home page could make you think that you must translate your Python code using Jython into a Java jar file. Obviously, this is not very convenient and can even be problematic if you depend on Python features not provided by Jython. Another issue of the Jython approach is the overhead of writing your Python program in such a way that it can interact with Hadoop - just have a look at the example in ${HADOOP_INSTALL}/src/examples/python/WordCount.py and you see what I mean. I still recommend to have at least a look at the Jython approach and maybe even at the new C MapReduce API called Pipes, it's really interesting. Having that said, the ground is prepared for the purpose of this tutorial: writing a Hadoop MapReduce program in a more Pythonic way, i.e. in a way you should be familiar with.

August 2007

April 2007

PUBLIC TAGS related to tag hadoop

no tag

Sponsorised links