» Map-reduce for dummies / Reply to comment

Reply to comment


Map-reduce for dummies

Posted on 31 January 2007 - 11:24

While browsing around, I incidentally found an article by Joel Spolsky where he introduces very simply and progressively the principles of the map-reduce pattern that underlies a big part of Google's infrastructure. A must read!

We use map-reduce at Joost to process usage data (roughly equivalent to the log files on a web server) and extract lots of useful information about the usage of the platform. This is built on Apache Hadoop, an open source implementation of map-reduce.

Considering the still limited numbers of users we have, map-reduce isn't absolutely necessary and a SQL database could have done the trick, but with the expected huge user base when Joost becomes generally available, having a solution that is able to scale mostly by throwing in more machines is a must have.

Reply

The content of this field is kept private and will not be shown publicly. If you have a Gravatar account, used to display your avatar.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.

More information about formatting options

CAPTCHA
Please answer this question to show that you're not a stupid spam robot.
6 + 0 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.

J'ai Failli Attendre