How can I implement MapReduce using shell commands?

Posted by alex on Stack Overflow See other posts from Stack Overflow or by alex
Published on 2010-04-16T16:28:12Z Indexed on 2010/04/17 13:13 UTC
Read the original article Hit count: 434

How do you execute a Unix shell command (e.g awk one liner) on a cluster in parallel (step 1) and collect the results back to a central node (step 2)?

Update: I've just found http://blog.last.fm/2009/04/06/mapreduce-bash-script It seems to do exactly what I need.

© Stack Overflow or respective owner

Related posts about parallel-processing

Related posts about rpc