Does changing the default HDFS replication factor from 3 affect mapper performance?

Posted by liamf on Server Fault See other posts from Server Fault or by liamf
Published on 2011-06-29T15:57:24Z Indexed on 2011/06/29 16:24 UTC
Read the original article Hit count: 255

Filed under:
|

Have a HDFS/Hadoop cluster setup and am looking into tuning.

I wonder if changing the default HDFS replication factor (default:3) to something bigger will improve mapper performance, at the obvious expense of increasing disk storage used?

My reasoning being that if the data is already replicated to more nodes, mapper jobs can be run on more nodes in parallel without any data streaming/copying?

Anyone got any opinions?

© Server Fault or respective owner

Related posts about hadoop

Related posts about hdfs