• If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • Finally, you can manage your Google Docs, uploads, and email attachments (plus Dropbox and Slack files) in one convenient place. Claim a free account, and in less than 2 minutes, Dokkio (from the makers of PBworks) can automatically organize your content for you.


Hadoop for Machine Learning Guide

This version was saved 12 years, 3 months ago View current version     Page history
Saved by Alex
on March 12, 2009 at 6:05:09 pm

Hadoop (hadoop.apache.org/core/) is a tool that makes it easy to run programs on clusters.  It uses the Map-Reduce framework: it distributes the computation over individual records (such as data points) over a cluster and then allows the results of that computation to be combined in a reduce step.  There is a tutorial at hadoop.apache.org/core/docs/current/mapred_tutorial.html that goes over the basics of Hadoop operation.



Comments (0)

You don't have permission to comment on this page.