Managing dependencies with Hadoop Streaming?

Posted by beagleguy on Stack Overflow See other posts from Stack Overflow or by beagleguy
Published on 2010-05-19T01:46:32Z Indexed on 2010/05/19 1:50 UTC
Read the original article Hit count: 275

Filed under:

hi all, had a quick hadoop streaming question.. If I'm using python streaming and I have python packages my mappers/reducers require that aren't installed by default do I need to install those on all the hadoop machines as well or is there some sort of serialization that sends them to the remote machines?

thanks!

© Stack Overflow or respective owner

Related posts about hadoop